PCT 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51)InteniadODal-Patentdassificationji_L 
C12N 15/00, C12P 21/02 
C12Q 1/68, G01N 33/68 



(11) International Publication Number; 



WO 89/ 09273 



(43) International Publication Date: 5 October 1989 (05.10.89) 



(21) International Application Number: 

(22) International Filing Date: 

(31) Priority Application Number: 

(32) Priority Date: 

(33) Priority Country: 



PCT/US89/01213 
22 March 1989 (22.03.89) 

171,634 

22 March i 988 (22.03.88) 
US 



(60) Parent Application or Grant" 

(63) Related by Continuation 
US 

Filed on 



171,634 (CIP) 
22 March 1988 (22.03.88) 



C71) Applicant (for all designated States except US): BOARD OF 
REGENTS. THE UNIVERSITY OF TEXAS SYSTEM 
[US/US]; 201 West Seventh Street, Austin, TX 78701 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only) : SONTHEIMER, Richard, 
D, [US/US]; 2252 Valley Mill, Carrollton, TX 75006 (US). 
LIEU, Tsu-San [US/US]; 7626 LaRisa, Dallas, TX 75248 
(US). CAPRA, J,, Donaid [US/US]; 3939 Duchess Circle, 
Dallas, TX 75229 (US). McCAULIFFE, Daniel, P. [US/US]; 
3129 Myra Lane, Fasrmers Branch, TX 75234 (US). 



(74) Agent: PARKER, David, L; Arnold, White & Durkee, P.O. 
Box 4433, Houston, TX 77210 (US). 

(81) Designated States: AT, AT (European patent), AU, BB, BE 
(European patent), BF (OAPI patent), BG, BJ (OAPI pa- 
- - tent), BR, CF (OAPIpatent), CG.(OAPI patent), CH, CH 
(European patent), CM (OAPI patent), DE, DE (European 
patent), DK, FI, FR (European patent), GA (OAPI patent), 
GB, GB (European patent), HU, IT (European patent), JP, 
KP, KR, LK, LU, LU (European patent), MC, MG, ML 
(OAPI patent), MR (OAPI patent), MW, NL, NL (Euro- 

pean-patent), NP._RO,_SD, SE , SE ( European patent), SN 

(OAPI patent), SU, TD (OAPI patent), TG (OAPI'patent),- 
US. 



Published 

With international search report. 

Before the expiration of the time limit for amending the claims 
and to be republished in the event of the receipt of amend- 
ments. 



(54) Title: METHODS AND COMPOSITIONS INCORPORATING AUTOIMMUNE ANTIGENIC EPITOPES 



NH 2 -I 



60 kO Ro ...POf-YPET'DE 
V 8 | CLEAVAGE 



- COOH 



I 



23k P 

t f \ 

/ / \ 

/ 



37kD 



-COOH 



24 AA 1 



10 AAl \ 



♦ 



72 BASE 



30 BASE 



SYNTHETIC OLIGONUCLEOTIDES 

(57) Abstract 

The present disclosure relates to DNA sequences encoding one or more antigenic epitopes of the Ro 60 kD autoanti- 
gen, as well as to antigenic peptides themselves which correspond antigenically to epitopes found on the Ro/SS-A ribonu- 
cleoprotein (RNP) particle. Peptides which incorporate the antigenic epitopic core sequences disclosed herein may be em- 
ployed in place of the Ro/SS-A RNP in any of a variety of immunoassays including ELISA assays. The polypeptides of 
the invention may be employed in colorimetric assays for the identification and characterization of autoimmune diseases 
such as systemic lupus erythematosus (SLE) and Sjogren's syndrome. The DNA sequences disclosed herein may be em- 
ployed in the preparation of the 60 kD Ro antigen, peptides which incorporate antigenic core sequences thereof, to probe 
for Ro sequences by hybridization analysis, and the like. 
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Methods and compositions incorporating autoimmune 
antigen epitopes. 



BACKGROUND OF THE INVENTION 
The Government may own certain rights in the present 
invention pursuant to NIH grants 12127, AR19101, AR07341 
5 and/ or AR01784. 

Reference is hereby made under 35 U.S.C. §12 0 to co- 
pending application serial number 171,634, filed March 22, 
1988. 
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1- Field of the Invention 

The present invention relates generally to peptides 
bearing selected antigenic epitopes and to nucleic acid 
compositions which may be employed in the preparation of 
such peptides, or to detect the presence of complementary 
nucleic acid sequences in biological samples. More 
particularly, the invention concerns peptides bearing 
antigenic epitopes corresponding to epitopes on the 60 kD 
polypeptide of the Ro/SS-A antigen (also referred to 
simply as the »Ro» antigen), and to DNA sequences encoding 
all or a portion of the 60 kD polypeptide. The invention 
further relates to processes incorporating the foregoing 
peptides and/or nucleic acid sequences, such as in the 
immunocharacterization of various autoimmune diseases or 
in the preparation of recombinant Ro antigen. 

z - Description of the Related Art 

Patients with rheumatic diseases can make autoanti- 
bodies to a variety of biological compounds derived from 
their own cells, including autoantibodies to products 
secreted by cells ( e.g. , rheumatoid factor to immuno- 
globulins) , constituents of the plasma membrane (e.g., 
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phospholipids, insulin receptors) as well as an array of 
intracellular components. Interestingly, of the greater 
than 10,000 macromolecules found inside a cell, only about 

30 are targ ets of autoantibody production (1) • However, 

5 autoimmune antibodies having immunospecif icity for one or 
more of these targets are found in a wide array of 
rheumatic diseases, including systemic lupus erythematosus 
(SLE) , mixed connective tissue disease (MCTD) , primary 
sicca syndrome, polymyositis, dermatomyositis, progressive 

10 systemic sclerosis (PSS) , rheumatoid arthritis (RA) , 

idiopathic thrombocytopenic purpura (ITP) , primary biliary 
cirrhosis (PBC) , chronic active hepatitis (CAH) , and a 
va r iety~"of ~o threr si 

15 Accordingly, the presence of autoantibodies in the 

serum of a patient is generally indicative of one or 
another of the various foregoing conditions- For example, 
autoantibodies to double-stranded (ds) DNA occur in 50-70% 
of SLE patients and are highly specific for this disease 

20 (2) .. This autoantibody is also occasionally seen in 

clinical settings where SLE overlaps with other rheumatic 
diseases ( e„g. , mixed connective tissue disease) . More- 
over, circulating ds-DNA antibody levels fluctuate with 
systemic disease activity, particularly renal involvement 

2 5 (3-4) , and this autoantibody specificity has been 

implicated in the more aggressive forms of lupus nephritis 
(5-6) . 

The presence of one or more of a variety of other 

3 0 autoantibodies have been used as indicators of rheumatic 

disease, including antibodies having specificity for 
chromatin structural proteins such as histones or 
nucleosomal structures or antibodies to ribonucleoprotein 
particles (RNP) such as nRNP, Ul snRNP and Sm antigens. 
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For example, studies employing solid phase immunoassays 
have' shown that anti-histone antibodies can be found in 
about 50% of the sera of unselected SLE patients (7-9), 
and in approximately 80% of patients with active disease 



5 (9) o 

The appearance of antibodies to Ro/SS-A_ and La/SS-B 
RNPs were probably first detected in 1958 in the search of 
patients with Sjogren's Syndrome, employing extracts of 

10 salivary tissue as antigens (10) • Later studies demon- _ 
strated two major specificities in salivary tissue 
designated SjD and Sjt (11), probably corresponding to 

Ro/SS-A— and— La/SS-B— respectively- Reichlin__and„Harley : 

have offered several recent comprehensive reviews of the 

15 clinical correlations of the Ro/SS-A and La/SS-B antigen- 
antibody systems (12-13) . 

Most interest has centered on the Ro/SS-A system 
since an autoantibody response to this antigen is much 

2 0 more common than one to La/SS-B. In addition, an auto- 
antibody response to La/SS-B is almost invariably 
associated with anti-Ro/SS-A antibody production, Anti- 
Ro/SS-A autoantibodies occur in the highest prevalence in 
Sjogren's syndrome (SS) patients. Moreover,' some invest i- 

25 gators have suggested that the use of a sufficiently 
sensitive solid phase assay employing purified Ro/SS-A 
antigen, virtually ail SS patients produce this autoanti- 
body (14) • 



3 0 Unfortunately, although purified Ro/SS-A antigen can 

be employed to immunodiagnose various autoimmune disorders 
and particularly Sjogren's syndrome, there are significant 
problems associated with its use* Of principal importance 
is the fact that although Ro/SS-A RNP particles can be 
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isolated 'to some degree of purity (15) , to do so is 
economically impractical, This is due principally to the 
high cost and time of isolating Ro/SS-A antigen proteins 

from natural sources. Moreover, in that the Ro/SS-A 

5 antigen is an RNP particle, it generally has stricter 
requirements for storage, as well as a more limited 
ability to prepare for commercial distribution. While a 
recombinant version of one or more protein subcomponents 
would prove to be useful in this regard, their development 

10 has "not ^ been " reported • Clearly there is a need" for such a 
recombinant version, including a need for DNA segments 
which can be employed in the preparation of peptides 

c ontairTing Ro antigenic sequences 7~whxch would~provid'e~a 

means for producing improved antigenic materials which may 

15 be recognized by antisera having specificity for auto- 
immune antigens such as Ro/SS-A. 



SUMMARY OF THE INVENTION 

20 

Accordingly, the invention concerns methods and 
compositions which may be employed to prepare an improved 
autoimmune antigenic material that addresses at least some 
of the disadvantages in the arte 

25 

The invention further concerns antigenic material 
that may be employed in the immunoidentif ication of 
autoimmune diseases, and particularly autoimmune diseases 
such as Sjogren's syndrome, lupus erythematosus and 
3 0 similar or related disorders • 

More particularly, the invention concerns relatively 
small peptides which may be substituted for RNP antigens 
such as Ro/SS-A in immunoassays, yet which may He readily 
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prepared synthetically, and easily stored for extended 
periods of time. 

The present invention addresses these concerns in 



5 certain embodiments through the provision of DNA sequences" 
which encode antigenic epitopes, referred to herein as 
epitopic core sequences, of the 60 kD Ro antigen peptide, 
'or biologically functional equivalents thereof. Most 
conveniently, one will desire to employ the 60 kD Ro 
10 antigen-encoding sequences of the invention to -produce 

essentially the complete, natural 60 kD protein by recom- 
binant means o However, for certain applications, the 



preparation of~ shorter~transcriptionai— un-i-t-s -encoding- 



epitopic core sequence (s) might prove desirable* There- 
15 fore, as used herein, the phrase "60 kD antigen", and 
variations thereof, is intended to refer generally to 
proteins and peptides bearing the antigenic sequences, 
including epitopic core sequences, as set forth in the 
present disclosure. It is specifically pointed out that 
20 the use of "60 kD" designation is intended as a reference 
to the natural 6 0 kD Ro antigen as a shorthand means or 
"coined" term for referring to the subject matter of the 
invention, and is not in any way intended to set forth or 
imply that the invention is limited to the preparation of 
25 proteins exhibiting this particular molecular weight. 

This aspect of the invention arises out of the 
inventors successful cloning of DNA encoding the 60 kD Ro 
antigen. From knowledge of ^ihis 60 kD Ro antigen DNA 
3 0 sequence as disclosed herein, one is enabled to prepare 
DNA sequences which encode a peptide which include at 
least an epitopic core sequence of the 60 kD Ro antigen, 
or which encode the full length antigen itself. Moreover, 
from knowledge of the biological interchangeability of 
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O 

various amino acids which allow one to alter amino acid 
structures without altering their underlying their 
biological functional activity, one is enabled by the 
_pr_esent_disclos_ur^^^ 



30 



5 embodiments, so called biologically functional equivalents 
as discussed below. • 

The invention is thus concerned in particular aspects 
with DNA sequences which encode the 60 kD Ro antigen 
10 identified as having an amino acid sequence essentially as 
set forth in Figure 2. This 60 kD Ro antigen is one of 
t wo Ro antigens found to have a molecular weight of about 
60lcD when subjected to SDS-polyacrylamide gel electro- 
phoresis. The 60 kD Ro antigen of the present invention 
15 has been found to contain epitopic core sequences which 
can be used in the preparation of proteins and peptides, 
useful in the immunological detection and identification 
of various autoimmune diseases, including disease such as 
systemic lupus erythematosus (SLE) and Sjogren's syndrome. 
20 The 60 kD Ro antigen, or antigenic peptides containing 60 
kD Ro antigen epitopes, can thus be employed directly in 
immunoassays such as those designed to detect the presence 
of cross-reacting antibodies in clinical samples. 

25 Sequence information obtained for the 60 kD protein 

and disclosed herein, whether it be amino acid or nucleic 
acid sequence information, can be used to construct 
synthetic peptides in accordance with the invention. This 
can be achieved either by chemical synthesis means, such 
as peptide synthesis, or through the use of recombinant 
techniques, for example, through the construction of 
recombinant hosts which express proteins or peptides in 
accordance with the invention. Thus, through information 
provided herein, one can construct DNA sequences 'which 
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encode ep"i topic core sequences derived from the 60 kD Ro 
antigen, or one can construct such antigens directly. In 
any event, nucleic acid sequences encoding the 60 kD Ro 

antig.en / __or_^nito pic core sequences therefrom, are 

5 important aspects of the present invention o 

Where recombinant techniques are employed to produce 
peptides in accordance with the invention, it will be 
appreciated by those of skill in the art in light of the 
10 present disclosure that it may be advantageous to prepare ~ 
recombinant vectors, such as plasmids, bacteriophage, 
viruses, etc* , which incorporate recombinant DNA sequences 

encoding the desired amino acid^sequences. The prepara= ~ 

tion and use of such vectors which incorporate the 
15 appropriate recombinant DNA segments will be apparent to 
those of skill in light of the disclosure herein and in 
light of techniques well known in the art* As used 
herein, the term "recombinant DNA segment" or "insert" 
means any DNA segment or fragment that is inserted into a 
2 0 recombinant vector either for the purpose of replicating 
or for expressing the recombinant fragment in a 
recombinant host to produce a desired peptide 0 



generally correspond to Ro antigen sequences as disclosed 
herein, yet which has one or more sequence modifications. 
This is referred to in the art generally as the ability to 
prepare so-called "second generation" structures* The 
sequence characterization of the 60 kD Ro antigen provided 
herein enables the preparation of such second generation 
structures, e.g., through the practice of specific DNA 



25 



The preparation and use of recombinant DNA segments 
in the practice of the invention offers many advantages, 
including among others, the ability to construct DNA 
segments which encode peptides having sequences which 



WO 89/09273 PCT/US89/01213 

a 

mutagenesis techniques , which are now well known in the 
art. 



Of—course, —where-one-_desires^to^_prepar_e^and_is,o late 

5 DNA segments which encode the 60 JcD antigen, or antigenic 
subportions thereof / the nucleic acid sequence of Figure 2 
will find particular utility in the preparation of nucleic 
acid hybridization probes «, Such probes are useful in the 
identification and selection of recombinant clones bearing 
10 the desired sequences. In that to be useful/ hybridiza- 
tion probes must be of sufficient length so as to be able 

to for m a relatively stable hybrid duplex with the target 

nucleic acid, one will desire to prepare probes having a 
length that is chosen to maximize duplex stability. It is 
15 generally believed that such probes, whether DNA or RNA in 
nature, should be at least, about 14 or so nucleotides in 
length, and more preferably about 18, or even 22 or so 
nucleotides in length o Such minimum probe lengths are 
preferred in order to ensure stable duplex formation under 
20 the selected hybridization conditions, as well as to 
minimize the possibility of cross hybridization to 
unrelated sequences. As will be appreciated, preferred 
hybridization conditions for such purposes will generally 
be more stringent conditions such as hybridization in 
25 about 6 x SSC at about 42° C for about 18 or so hours, 
followed by washing with 1 x SSC for 2 hours at 42° C, 
depending on the probe or primer length being usedo These 
conditions serve to minimize undesirable cross- and non- 
specific hybridization - 

30 

Accordingly, for embodiments directed to the 
preparation and use of nucleic acid segments such as the 
foregoing, the invention may be defined in particular 
aspects as being directed to substantially purified 
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nucleic acid segments which correspond, or are complemen- 
tary to, at least a 14 nucleotide long region of the DNA 
sequence of Figure 2. In more preferred aspects, such 
seg ments may be defined as corresponding to, or being 
5 complementary to, at least an 18, or even 2 2 , nucleot ide 
long region of the DNA sequence of Figure 2 J . As used 
herein, the term "substantially purified" is intended to 
refer to DNA segments isolated free of their natural state 
as they may be present in the genome of an organism, and 
10 "is" intended to include such segments as they would exist, 
e.g., upon genetic engineering such as by insertion into a 
recombinant vector. 



15 method for identifying the presence of a nucleic acid 

sequence which encodes at least a portion of the 60 kD Ro 
antigen, or a biologically functional equivalent thereof, 
in a biological sample suspected of containing such a 
sequence. The method of this aspect of the invention 

2 0 involves generally the steps of 1) incubating nucleic 

acids which may be present in the biological sample with a 
60 kD Ro antigen DNA segment disclosed herein under 
conditions appropriate for the formation of specific 
hybrids; and 2) detecting the formation of specific 
25 hybrids between the nucleic acids and the segment by means 
of a label, wherein the formation of such a duplex is 
indicative of the presence of such a nucleic acid sequence 
in the biological sample. The term "biological sample" is 
thus intended to refer broadly to any sample containing, 

3 0 or thought to contain, biological genetic material, and 

includes, e.g., a recombinant host cell colony or even 
isolated DNA samples. 



The invention is directed in certain embodiments to a 
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Antigens of the invention may be defined as including 
polypeptides of a relatively short length, which cross- 
react immunologically with antisera reactive against the 

6 0— KD— protein— of — the^Ro/SS -A_antigen Such polypeptides 

5 have been shown by the inventors to be useful in the 

identification of the anti-Ro/SS-A antibodies in clinical 
samples , and it is proposed generally that the poly- 
peptides described herein will prove useful in the same 
ways that one may employ the Ro/SS-A antigen itself , for 

10 example, in a variety of immunological techniques 

including both competitive and non-competitive immune- 

assays - 



15 particular embodiments to peptides which incorporate amino 
acid sequences discovered by the inventors to correspond 
to antigenic epitope (s) of the Ro/SS-A antigen. An aspect 
of the invention is predicated at least in part on a 
realization by the inventors that a simple peptide 

20 sequence can be employed, e.g M in immunoassays, in place 
of the natural Ro/SS-A ribonucleoprotein complex itself o ■ 

As noted above, certain embodiments of the invention 
relate to DNA sequences which encode antigenic sequences 

25 of amino acids that include within their sequence an 
epitopic "core" sequence, as well as to the antigenic 
peptides themselves * An epitopic core sequence, as used 
herein f is a relatively short stretch of amino acids that 
is "complementary" to, and therefore will bind, antigen 

30 binding sites on anti-Ro antibodies (ioe M anti-Ro/SS-A 

antibodies) « It will be understood that in the context of 
the present disclosure, the term "complementary", when 
used in connection with amino acid sequences, refers to 
amino acids or peptides that exhibit an attractive force 



The present invention is accordingly directed in 



m 
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towards -each other. Thus, epitope core sequences of the 
present invention may be operationally defined in terms of 
their ability to compete with or even displace the binding 
-of-Ro/SS^A-antieren^wit h anti~Ro /SS-A antisera. With 



5 respect to nucleic acid sequences, the term 

"complementary" sequences refers to sequences having 
sufficient complimentarily to allow specific cross- 
hybridization of nucleic acid strands. 

10 The size of the encoded polypeptide antigen "is not " 

believed to be particularly crucial, so long as it is at 
least large enough to carry the identified epitope core 



sequence or sequences . The smallest core sequence of - the - 
present disclosure is on the order of about 13 amino acids 
15 in length. Thus, this size will generally correspond to 
the smallest peptide antigens prepared in accordance with 
the invention. Of course, the size of the antigen may be 
larger where desired, so long as it contains the basic 
epitopic core sequence. 



20 



25 



30 



Certain embodiments of the invention are directed to 
the recombinant production of Ro antigenic proteins or 
peptides. In general, these embodiments concern methods 
for the preparation of a peptide which includes at least 
an epitopic core sequence of the 60 kD Ro antigen, or a 
biologically functional equivalent thereof. These methods 
include generally l) preparing a recombinant vector which 
incorporates the desired Ro antigen-encoding DNA sequence; 
2) translationally expressing the recombinant vector in an 
appropriate host so as to obtain expression of the Ro 
antigen-encoding sequences; and 3) collecting the peptide 
so produced. 
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In c'ertain embodiments, antigens of the invention may 
be defined in terms of polypeptides which include within 
their sequence the following sequence of amino acids 

discov-ered„by_the present inventors to comprise a Ro/SS-A 

5 epitopic core sequences 

-lys-glu-gln-phe-leu-asp-gly-asp-gly-trp-thr- 
asp/ser-arg- (see Figure 2, amino acids 24 to 36); 

10 -asn-ser-gln-val-glu-ser-gly-ser-leu-glu-asp-asp^ 

trp-asp-phe-leu-pro-pro-lys-lys-ile-lys- (amino acids 
188 to 209) ; and 



-his-ile-pro-asp-pro-asp-ala-lys-lys-pro-glu-asp- 
15 trp-asp-glu- (amino acids 241-255) * 

or biologically functional equivalent amino acids. [As an 
example of biological functional equivalents, experimenta- 
tion has indicated to the inventors that the amino acid at 
20 position 12 (Fig. 2 amino acid no. 35) of the first of the 
foregoing epitopic core sequence can be either an "asp" or 
"ser" residues.] 

As noted, these polypeptides will typically have an 
25 overall length ranging from about 13 to about 25 amino 
acids o Thus, the lower size limit for the peptide will 
correspond to about the size of the epitope itself, about 
13 amino acids, whereas the upper size of the peptide 
antigen will be about 25, Peptides much larger than this 
are generally undesirable for a variety of reasons, 
including, e.g, , added difficulty in synthesis (if . 
synthesized) , changes is solubility properties, or 
inadvertent addition of undesirable epitopes „ 



30 
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In more preferred embodiments, the peptide antigen 
will be selected from the group of peptides consisting 
essentially of: 



10 



-phe-lys-glu-gln-phe-leu-asp-gly-asp-gly-trp-thr- 
asp/ser-arg- ; 

-lys-glu-gln-phe-leu-asp-gly-asp-gly-trp-thr- 
asp/ser-arg-trp-ile-glu~ser- ; 

-asn-ser-gln-val-glu-ser-gly-ser-leu-glu-asp-asp- 
trp-asp-phe-leu-pro-pro-lys-lys-ile-lys- ; 



-his-ile-pro-asp-pro-asp-ala-lys-lys-pro-glu-asp- 
15 trp-asp-glu- ; 

or biologically functional equivalent amino acids, or 
larger peptides which incorporate these sequences or their 
functional equivalents. As will be appreciated, the 
2 0 foregoing sequences include within their sequences the 
basic epitopic core sequences discussed above o 



As used herein, the phrase "biologically functional 
equivalent" amino acids refers to the fact that the 

25 invention contemplates that changes may be made in certain 
of the foregoing basic amino acid sequence (s), without 
necessarily reducing or losing their antigenic identity* 
For example, the sequence can be altered through 
considerations based on similarity in charge ( e.g. , 

30 acidity or basicity of the amino acid side group) , hydro- 
pathic index, or amphipathic score. In general, these 
broader aspects of the invention are founded in part on 
the foregoing general understanding in the art that 
certain amino acids may be substituted for other* like 
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amino acids without appreciable loss of the peptide's 
ability to bind to the antibodies, and thus be recognized 
antigenically. Exemplary amino acid substitutions are set 

forth -here±nb.elow_._ __„^^ 

5 

In exemplary embodiments, it is proposed that 
biologically functional equivalents of one of the fore- 
going epitopic core sequences can be identified by the 
formula: 

10 - - 

-[AA]- 

— ~ 1-13 . 

15 wherein , 



30 







lys, 


arg, 


gin , . or 


asn; 


AA 2 




glu, 


asp, 


gin, or 


asn; 


AA 3 




gin, 


asn, 


arg, or 


asp; 


AA 4 




phe, 


tyr, 


or trp; 




AA 5 




leu, 


ile, 


or val; 




^6 




asp, 


glu, 


asn, or 


gin; 


AA 7 






ala, 


or thr; 




^8 




asp, 


glu, 


asn, or 


gin; 


AA 9 




giy, 


ala, 


or thr; 




^10 




trp, 


tyr, 


or phe; 








thr, 


ser., 


or gly; 




AA 12 




asp, 


glu, 


asn, gin, ser, thr, ala or 






giy; 






AA 13 




arg, 


lys, 


asn, or 


gin. 



The foregoing exemplary embodiment demonstrates 
possible biologically functional equivalents of the 13 
amino acid long epitopic core sequence discussed 'above. 



WO 89/09273 • 15 • PCT/US89/01213 

It should be appreciated that the present invention 
contemplates that similar types of substitutions will 
prove useful in preparing biologically functional equiva- 

lents of the remainin g epitopic core sequences set forth 

5 herein. 

As noted above, certain aspects of the invention 
relate to DNA or nucleic acid sequences encoding antigenic 
peptides which incorporate antigenic core sequences such 

10 as" the foregoing," A particular embodiment of the 

invention thus relates to DNA or nucleic acid segments 
which encode one or more of the epitopic core peptide 

s^queli^s~s'et~f orth~in— Figure— 2 • One— such-DNA-segment 

encodes amino acids 23 or 24 through 3 6 of the amino acid 

15 sequence of Figure 2 • Another such segment useful in the 
practice of the invention encodes amino acids 23 or 24 
through 40 of the amino acid sequence of Figure 2. Still 
another such segment encodes amino acids 188 through 209 • 
These regions of the 60 kD Ro antigen of Figure 2 

2 0 correspond to the epitopic core sequences discussed above. 

In addition to the foregoing sequences, it is 
believed that the Ro antigen of Figure 2 includes numerous 
other potential antigenic core sequences which may be 
25 similarly employed in the practice of the invention * Such 
antigenic core sequences can be identified as relatively 
hydrophilic stretches of amino acids along the Figure 2 
amino acid sequence, and may be predicted, for example, 
through the use of software designed to predict antigenic 

3 0 amino acid structures. Software, such as Chou-fasman, has 

been employed by the inventors to identify antigenic 
regions of peptides by means of a consideration of the 
hydrophobicity and/or hydrophilicity of peptidyl 
structures* From such an analysis of the Figure 2 
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antigen, at least 3 or so regions have been identified as 
likely containing antigenic core sequences. These 
regions, as denoted in Figure 2, correspond generally to 

am-ino-acids— 24— through— 3 6-,°— amino^acids_188— through_209,; 

5 and amino acids 241 through 255- Therefore, in certain 
aspects, the invention is concerned with nucleic acids 
which encode the foregoing core sequences, as well as the 
peptides themselves o 

10 Of course, the antigenic peptides of the invention 

will generally find their greatest utility in assays which 
require the selection and/or identification of antibodies 



20 



having reactivity with Ro/SS-A RNP antigens, such~as~TnT 
the context of ELISAs, RIAs, or even Western blot 
15 analyses. In a general sense, these immunologic methods 
include methods for testing for the presence of anti- 
Ro/SS-A antibodies in a sample, comprising immunologically 
testing the sample for antibodies which cross react with 
an antigen which includes within its amino acid sequence 
one or more of the foregoing Ro/SS-A epitopic core 
sequences* 



For example, in the context of an ELISA assay for 
detecting anti-Ro/SS-A antibodies (or antigens) in a 

25 sample (e.g, , a clinical sample which contains anti- 
bodies) , antigens of the present invention may be employed 
in a number of manners . In so-called non-competitive 
assays, antigens in accordance with the invention may be 
employed directly to identify cross-reacting antisera by, 

3 0 e.g. , binding Ro/SS-A epitopic core sequence-containing 
peptides to a solid matrix, contacting the surface with 
the antibody-containing sample under conditions which are 
favorable to immunocomplex formation, washing to remove 
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non-immunocomplexed material, and detecting any such 
immunocomplex formation . 

Use of antigenic peptides of the invention in the 



5 context of competitive assays such as RIAs or competitive 
ELISAs is similarly enabled. In these assays, the Ro/SS-A 
epitope-containing peptides may be employed to- identify 
either Ro/SS-A antigen or its corresponding antibody* For 
example, standard curves can be determined experimentally 

10 which' plot the correlation between the concentration of- 
Ro/SS-A epitopic core peptides and immunoreactivity with 
anti-Ro/SS-A antisera. By introducing unknown quantities 
" of~^tTig^ic~mte^^ — 
for example, from a clinical sample, one can. calculate the 

15 amount of antigen contained therein by the ability of the 
clinical sample to "compete" with the control. 



Accordingly, particular advantages of the invention 
will be realized through the preparation of synthetic 

20 peptides which include the desired epitopic core 

sequences. As used herein, the term "synthetic" peptides 
or antigens refers to peptides or antigens which are 
prepared by means other than by purification of naturally 
occurring peptides or compositions. Thus, "synthetic" 

25 peptides includes- protein sequences prepared by peptide 
synthesis or through recombinant production. Thus, the 
invention is directed in a broad sense to synthetic 
peptides which include the above-described epitopic core 
sequences, or their biological functional equivalents, 

3 0 within their structures. As will be appreciated by those 
who practice the present invention, the use of these so- 
called "synthetic" peptides in place of naturally 
occurring antigens will result in a number of distinct 
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advantages which are not enabled by use of the naturally 
occurring Ro/SS-A RNP antigen. 



5 directed in certain embodiments to immunodetection kits 



Such kits include a metered quantity of an antigenic 
peptide in accordance with the invention, together with a 
means for detecting immunocomplexes between the antigen 

10 and its antibody component « The term "metered"" in the 
context of the invention refers to aliquots of antigen 
having a predetermined or quantified antigenic reactivity 

such~that a measured amountr^f ~ the~Royss=Ar~antigeni _ c 

material will provide roughly expected level of immuno- 

15 reactivity, such as, e,g. , in terms of chromogenic 
reactivity in an ELISA assay « 

Useful immunocomplex detecting means are generally 
well known in the art, and include materials such as an 

2 0 antibody having specificity for the Ro antigenic 

substance, or an anti-Ig having specificity for antibodies 
of, e.q-o r human origin. In either case, the detecting 
means would include a label such as a radioactive iigand 
or a chromogenic enzyme such as horseradish peroxidase, 
25 alkaline phosphatase, or urease, on even using the 
avidin-biotin reaction- In certain embodiments, for 
example, in an RIA-directed kit, it is contemplated that 
the detecting means may include a label, whether radio- 
active, enzymatic or the like, attached directly to the Ro 

3 0 antigenic material itself « In such embodiments, the 

detecting means functions by allowing one to detect or 
even quantify the interaction between an antibody and the 
labeled antigen, or the ability of the labeled antigen to 
compete for immunoadsorption with the antibody « * In any 



Further, it is pointed out that the invention is 



for detecting Ro/SS-A antigens or antibodies in samples. 
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case, the design of kits of the foregoing nature will be 
apparent to those of skill in the art in light of the 
present disclosure and previous teachings such as U.S. 
patents 4,454,233; 4,446,232 or 4,376,110; all incor- 

5 porated~"herein by reference- 

BRIEF DESCRIPTION OF THE DRAWINGS 

10 - Figure L - Synthetic oligonucleotide .construction* „ 

The native 60 kD Ro polypeptide was purified from a Wil-2 
cell extract and subjected to a limited Staph , aureus V8 

protease-digestion— which-cleaved— the-60— kD_poly-peptide 

into a 23 and 37 kD domain • The amino terminus of each 

15 domain was sequenced and this amino acid sequence 

information was converted into the most probable nucleic 
acid sequence for the construction of two non-degenerate 
synthetic oligonucleotides. 

20 Figure 2 « The 1.9 kb Ro cDNA nucleic acid and 

encoded amino acid sequence* The 189 0 base coding strand 
encodes a 417 amino acid polypeptide which includes four 
previously determined amino acid sequences (underlined) 
from sequencing the native protein and cyanogen bromide 

25 and Staph „ aureus V8 cleavage products. The eukaryotic 

ribosomal consensus sequence for the initiation of trans- 
lation is boxed and the putative polyadenylation signal is 
overlinedo 

3 0 Figure 3 * A. The Ro cDNA encoded amino acid 

sequence. The hydrophobic leader segment is boxed, the 
sequences corresponding to two synthetic peptides with 
antigenic activity are in parentheses and a putative 
nuclear targeting signal is overlined with a broken line. 
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The two sets of internal sequence duplications are under- 
lined and three of the PEST,D rich areas are indicted by 
overlying dots. The negatively charged amino acids at the 

carboxy term inal end are indicated by (-) signs and the 

5 KDEL carboxy— terminal endoplasmic reticulum retention 

signal sequence is overlined with stars. B. The two sets 
of internal duplications are aligned for ease of 
comparison o The numbers represent the amino acid sequence 
position • 

10 - 

Figure 4 . Chou-Fasman structural and Jameson-Wolf 
antigenicity predictions of the Ro polypeptide* The 

numbers— represent— am-i-no-ao-id— sequence— posit ions-,— pleated - 

lines represent beta sheets, wavy lines represent alpha 

15 helices and directional changes represent turns . The 

enclosed areas indicate potential antigenic sites. Also 
indicated are the positions corresponding to intron 
junction sites , derived from genomic mapping experiments. 

2 0 Figure 5 . Genomic restriction map- Various portions 

of the Ro cDNA were radiolabeled and hybridized to 
multiple restriction enzyme digests of human genomic DNA 
by the Southern technique* The length of each labeled 
fragment was determined and a composite restriction map 
25 was thus constructed. The map indicates that this Ro gene 
resides within a 6 kb stretch of chromosomal DNA„ 

Figure 6 , Depicted is the genomic configuration of 
the 60 kD Ro antigen gene, showing its exons and introns 

3 0 in terms of relative length. Also depicted are the 

locations of promoter elements (FEs) f poly A + adenylation 
site (AUUAAA) and GT rich sites along the genomic 
sequence. 
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Figure 7 . Promoter elements of the 60 JcD Ro antigen 



gene. 



5 DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Introduction 

Prior to the present invention, the immuncanalysis of 
10 autoimmune disease antibodies of the anti-Ro/SS-A type has 
required the use of the Ro/SS-A antigen itself, a complex 
antigen which includes both RNA and protein substituents 

— —bound— together— i-n—a-part-i-e^ 

particle. Unfortunately, use of the Ro/SS-A RNP particle 
15 in the characterization of this disease is fraught with 

difficulties which range from the relative scarcity of the 
Ro/SS-A RNP particle and low yields associated with its 
isolation from natural sources, to difficulties associated 
with the stability of the Ro RNP particle upon storage 
2 0 under economically and/or commercially reasonable storage 
conditions. While the former problems arise in part from 
the low level production of the Ro particle in sources 
such as human or bovine spleen tissue or human B-cell 
lines, the latter problems are likely attributable to the 
2 5 prevalent nature of ribonucleases and proteases or perhaps 
the complex structure of the particle . 

The present invention is directed to solving these 
and other problems through the surprising discovery by the 
*0 inventors that a relatively short peptide sequence can 
substitute for the much larger, RNA/protein particle 
complex in immunocharacterization studies and assays. in 
that such sequences can be readily synthesized by a 
variety of means, including chemically synthetic ' or even 
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recombinant techniques , the peptides are relatively easy 
to prepare in economical quantities. Moreover, in that 
the antigen so produced is a simple peptide structure 
rather— than_an^RNP_particle or a large naturally occurring 



5 protein such as the 60 Kd protein antigen, storage and use 
by a variety of manners is thereby enabled* 

The present invention is further directed to nucleic 
acid which encodes the 60 kD Ro antigen, and/or which 
10 corresponds to a portion of the antigen gene* This aspect 
of the invention arises out of the inventors preparation, 
isolation and subsequent sequencing and analysis of cDNA 



sequences corresponding to tfieT~60~lcD~Ro anti~geri~mRNA~; 

This information, disclosed herein in Figure 2, allows the 

15 preparation of the 60 kD antigen by recombinant means as 
well as enabling the preparation, by means of DNA 
engineering techniques such as site-directed mutagenesis, 
of so-called second generation peptides which incorporate 
desired sequence attributes of the 60 kD sequence, such as 

20 the incorporation of antigenic core sequences, or which 
incorporate selected mutations or variations into 60 kD 
antigen sequences, e«g. , in order to achieve a desired 
improvement, such as an improvement in antigenic function • 

25 In addition to their usefulness in the preparation of 

peptides, the nucleic acid sequences disclosed herein may 
be employed to take advantage of their ability to 
hybridize to corresponding 60 kD Ro antigen gene 
sequences. Thus, the sequence information of Figure 2 

3 0 will find utility in a variety of embodiments which take 
advantage of this property, e.g., in the screening of 
clone banks, both first and second generation banks, in 
probing the structure of the 60 kD Ro antigen genomic 
gene, in testing for the presence of 60 kD antigen gene 
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sequences in biological samples, such as skin or heart 
tissue. 



Polypeptides of the Invention 

5 

Polypeptides of the invention are defined in their 

most basic sense as including one or more of the following 

amino acid stretches, which have been identified as a 

Ro/SS-A epitopic core sequence: 

10 ' " - - • 

-phe-lys-glu-gln-phe~leu-asp-gly-asp-gly-trp=thr- 

asp/ser-arg- ; 

-lys-glu-gln-phe-leu-asp-gly-asp-gly-trp-thr- 
15 asp/ser-arg-trp-ile-glu-ser- ; 

-asn-ser-gln-val-glu-ser-gly-ser-leu-glu-asp-asp- 
trp-asp-phe-leu-pro-pro-lys-lys-ile-lys- ; 

2 o -his-ile-pro-asp-pro-asp-ala-lys-iys-pro-glu-asp- 
trp-asp-glu- ; 



Syntheses of the foregoing epitopic core sequences, 
25 or peptides which include the foregoing within their 

sequence, is readily achieved using conventional synthetic 
techniques such as the solid phase method ( e.g. , through 
the use of commercially available peptide synthesizer such 
as an Applied Biosystems Model 4 3 OA Peptide Synthesizer) . 
3 0 Peptide antigens synthesized in this manner may then be 
aliquoted in predetermined amounts and stored in conven- 
tional manners, such as in aqueous solutions or, even more 
preferably, in a powder or lyophilized state pending use. 



WO 89/09273 PCT/US89/01213 

24 

In general, due to the relative stability of the 
peptides of the invention, they may be readily stored in 
aqueous solutions for fairly long periods of time if 

desired., — e...cr_., ,_up to s ix months or more, in virtu all y any 

5 aqueous solution without appreciable degradation or loss 
of antigenic activity* However, where extended aqueous 
storage is contemplated it will generally be desirable to 
include agents including buffers such as Tris to maintain 
a pH of 7.0 to 7.5. Moreover, it may be desirable to 
10 include agents which will inhibit microbial "growth, such 
as sodium azide. For extended storage in an aqueous state 
it will be desirable to store the solutions at 4°C, or 
more preferably, frozen. 

Of course, where the peptide (s) are stored in a 
lyophilized or powdered state, they may be stored 
virtually indefinitely, e.g. , in metered aliquots that may 
be rehydrated with a predetermined amount of water 
(preferably distilled) prior to use. 

Biological Functional Ecruivalent Amino Acids 

As discussed above, it is generally known in the art 
that certain amino acids may be substituted for other 
25 amino acids in a protein structure without appreciable 
loss of interactive binding capacity with complementary 
structures such as antigen-binding regions of antibodies 
(or, e.ffo , binding sites on receptor molecules). It is 
thus hypothesized by the present inventors that various 
3 0 changes may be made in the sequence of the antigenic 
peptides without appreciable loss of their antibody- 
binding, or Ro/SS-A antigen competing, activity. 
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The importance of the hydropathic index of amino 
acids in conferring interactive biologic function on a 
protein has been discussed generally by Kyte et ah (16), 
wherein it is found that certain amino acids may be 



5 substituted for other amino acids having a similar hydro- 
pathic index or score and still retain a similar b- 
iological activity. As displayed in the table below, 
amino acids are assigned a hydropathic index on the basis 
of their hydrophobic ity and charge characteristics- It is 
10 believed that the relative hydropathic character of the- 
amino acid determines the secondary structure of the 
resultant protein, which in turn defines the interaction 
of~the prote~in~with~substrate-mol-ecuies-; 



15 
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Amino Acid Hydropathic Index 
I-s oleuc i-ne — 4 . 5 



Valine 4.2 

Leucine 3.8 

Phenylalanine 2.8 

10 Cysteine/ cystine 2.5 

Methionine 1.9 

Alanine 1 . 8 

Glycine -0.4 

Threonine -0.7 



15 Tryptophan -0.9 

Serine -0.8 

Tyrosine -1.3 

Proline -1.6 

Histidine -3 . 2 

2 0 Glutamic Acid -3.5 

Glut amine -3.5 

Aspartic Acid -3 . 5 

Asparagine -3.5 

Lysine -3 . 9 

25 Arginine -4.5 



Thus, for example, isoleucine, which has a hydro- 
pathic index of +4.5, can be substituted for valine (+ 
3 0 4.2) or leucine (+ 3.8), and still obtain a protein having 
similar biologic activity. Alternatively, at the other 
end of the scale, lysine (-3.9) can be substituted for 
arginine (-4.5), and so on. 

35 Accordingly, these amino acid substitutions are 

generally based on the relative similarity of R-group 
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substituents , for example, in terms of size, electrophilic 
character, charge, and the like* Substitutions which take 
various of the foregoing characteristics into considera- 
tion include the following: 



TABLE II 



10 



Original Residue 



Exemplary Substitutions 



15 



20 



25 



30* 



Ala 


gly; 


ser 


Arg 


lys 




Asn 


gin; 


his 


Asp 


glu 




Cys 


ser 




Gin 


asn 






aso 




Gly 


ala 




His 


asn; 


gin 


lie 


leu; 


val 


Leu 


ile; 


val 


Lys 


ar g; 


gin; 


Met 


met; 


leu; 


Ser 


thr 




Thr 


ser 




Trp 


tyr 




Tyr 


trp; 


phe 


val 


ile; 


leu 



Immunoas s ays 



It is proposed that the peptides of the invention 
3 5 will find their greatest utility in immunoassays for the 
detection of Ro/SS-A reactive antibodies. In their most 
simple and direct sense, preferred immunoassays of the 



WO 89/09273 PCT/US89/01213 

28 

invention xnclude enzyme linked immunosorbent assays 
(ELISAs) , but, as discussed above, utility is clearly not 
limited to such assays* 



5 In the preferred ELISA assay, peptides incorporating 

the Ro/SS—A epitopic core sequences are immobilized onto a 
selected surface, preferably a surface exhibiting a 
protein affinity such as the wells of a polystyrene 
microtiter plate o After washing to remove incompletely 

10 adsorbed material, one will desire to bind or coat a 

nonspecific protein such as bovine serum albumin (BSA) or 
casein onto the well that is known to be antigenically 
neutral with regard to the test antisera, TlTi~s~~aITbws"~ "for 
blocking of nonspecific adsorption sites on the 

15 immobilizing surface and thus reduces the background 
caused by nonspecific binding of antisera onto the 
surface,, 



After binding of antigenic material to the well, 
20 coating .with a non-reactive material to reduce background, 
and washing to remove unbound material, the immobilizing 
surface is contacted with the antisera to be tested in a 
manner conducive to immuno-complex formation <, Such 
conditions preferably include diluting the antisera with 
25 diluents such as BSA, bovine gamma globulin (BGG) and 

phosphate buffered saline (PBS)/Tween, These added agents 
also tend to assist in the reduction of nonspecific 
background,, The layered antisera is then allowed to 
incubate for from 2 to 4 hours, at temperatures preferably 
3 0 on the order of 25° to 27 -c. Following incubation, the 
antisera-contacted surface is washed so as to remove non- 
immunocomplexed material, A preferred washing procedure 
includes washing with a solution such as PBS/Tween, or 
borate buffer o 
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Following formation of specific immunocomplexes 

between the test antisera and the bound antigen, and 

subsequent washing, the amount of immunocomplex formation 

may be determined by subjecting to a second antibody 

5 having specificity for the first. Of course, in that the 
test antisera will typically be human antisera, the second 
antibody will preferably be an antibody having specificity 
in general for human Ig- To provide a detecting means, 
the second antibody will preferably have an associated 
10 enzyme that will generate a color development upon 

incubating with an appropriate chromogenic substrate- 
Thus, for example, one will desire to contact and incubate 

' t Ke~an ti"s e r a^b ound - s ur f a c e— wi-t-h— a— p er ox-id as e - c onj-uga t ed : 

anti-human IgG for a period of time and under conditions 
15 which favor the development of immunocomplex formation 
( e. g. , incubation for 2 hours at room temperature in a 
PBS-containing solution such as PBS-Tween) <> 

After incubation with the second, enzyme-tagged 
2 0 antibody, and subsequent washing to remove unbound 

material, the amount of label is quantified by incubation 
with a chromogenic substrate such as 2 , 2 ' -azino-di- (3- 
ethyl-benzthiazoline-6-sulf onic acid [ABTS] and K^O , in 
the case of peroxidase as the enzyme label, Quantifica- 
25 tion is then achieved by measuring the degree of color 
generation, e.g. , using a visible spectra spectro- 
photometer- 

Nucleic Acid Sequences of the Invention 

30 

Nucleic acid sequences of the invention are defined 
in their most basic sense as either 1) nucleic acid 
sequences which encode a sequence of amino acids which 
includes within its sequence at least an epitopic core 
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sequence of the amino acid sequence of Figure 2, or 2) 
nucleic acid sequences which are capable of forming a 
stable hybrid duplex with the 60 kD Ro antigen gene 
sequences . Thus, included within category 1) are nucleic 
5 acid sequences which encode peptides or proteins which 
incorporate "any of the antigenic peptide sequences 
discussed above, Qf course, due to codon redundancy or 
perhaps the desire to incorporate variations, the actual 
DNA or RNA sequence constructed and/or otherwise obtained 
10 and employed may vary from that actually found in nature, 
and thus may vary from that set forth in Figure 2* 



; The— category 2)~ nucleic— acid— sequences- wi-H— generaily- 
find their greatest use as hybridization probes, e.g., to 

15 detect the presence of corresponding sequences in a 
biological sample. For such applications, it will 
generally be necessary to prepare probes having a stretch 
of nucleotides long enough to form a stable hybrid duplex 
with the projected target sequence. For this reason, 

2 0 category 2) nucleic acids will generally include at least 
a 14 nucleotide long region that is complementary to, or 
which corresponds to, the 60 kD Ro antigen sequence as set 
forth in Figure 2 . The reason that such nucleic acid 
molecules can either be "complementary to" or "corres- 

25 ponding to" the Figure 2 sequence is that the ultimate 
target nucleic acid, if DNA, will include both 
complementary DNA strands, with each complementary strand 
being available for probing. 

30 To prepare nucleic acid sequences for use in accor- 

dance with the invention, one may desire to employ either 
recombinant or synthetic means. Where only short 
stretches of DNA are needed, e.g., having a length on the 
order of 3 0 to 40 or so nucleotides, one may desire to 



WO 89/09273 • 3i • PGT/US89/01213 

prepare the desired segment (s) synthetically, such as 
through the use of readily available DNA synthesizing 
technology. Due to a practical limitation on the size of 
nucleotides that can readily be prepared synthetically , 
5 s^h^h^i^r~syntHetic~prepa 

find their greatest utility in the preparation of segments 
for use as hybridization probes. However, a synthetic 
approach should not be ruled out "when one seeks to prepare 
translational units for use, e.g., in the recombinant 
-10 production of antigenic peptides, particularly^ where the 
preparation of only smaller peptides is contemplated. 

F.or^cer_tain_applications, e.g., where larg er nucleic 

acid polymers are required, one will generally find it 

15 most advantageous to prepare suitable nucleic acid poly- 
mers by recombinant techniques. The most preferred 
approach is cDNA cloning in that a nucleic acid molecule 
is obtained having a transcription unit that does not 
require RNA splicing of the subsequent RNA transcript. 

20 This, of course, allows one to employ prokaryotic hosts 
for recombinant production of antigenic peptides. As is 
appreciated in the art, such hosts can not readily be 
employed to produce recombinant peptides where intron- 
containing coding sequences such as genomic sequences are 

25 used, due to the inability of the host to faithfully* 
process the RNA intermediate. 

In preferred embodiments, the most desired source of 
DNA segments encoding the 60 kD Ro antigen gene will be 
3 0 directly from the inventors' deposit of biological 

material with the ATCC (plasmid pGEM containing Ro cDNA 
recombinant insert; deposited with ATCC on 3/21/89 and 
accorded accession number 40583 ) . From this deposit, 
which includes DNA containing the entire coding Sequence 
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of the 60 kD Ro antigen gene, as well as both 3' and 5' 
untranslated regions, one can readily prepare subfrag- 
ments, desired restriction fragments, and even as a 

starting— point— for— the-preparation_o,f_second generation 

5 materials, e.g„, as a template for site-directed 

mutagenesis. However, where one desires to clone the 60 
kD Ro antigen gene de novo , the sequence information 
provided by Figure 2 can be employed to prepare 
appropriate probes* 

10 

In general, cDNA cloning of the 60 kD Ro antigen gene 
can^bejper formed through the preparation and screening of 
a cDNA clone bank such as disclosed b elow in the~ex amplest 
Due to the finding that they contain ample amounts of 60 
15 kD Ro antigen RNA sequences, a preferred source of 

starting poly A+ RNA for use in cDNA construction is the 
Wil-2 B-cell line. This line is an Epstein-Barr virus 
transformed B-cell line that is readily available from a 
variety of sources, such as the Mutant Cell Repository « 
20 However, it is believed that other sources of starting 
mRNA can be successfully employed, including, e«g., any 
B cell line, such as can be obtained from public cell 
repositories like the ATCC. In the Wil-2 line, the 60 kD 
Ro antigen mRNA sequences are somewhat rare, on the order 
25 of about Is 10,000 molecules • Therefore, one will 

generally desire to prepare a bank having at least about 
1 x 10 6 members to ensure the presence of a full length 
cDNA transcript. 

30 Screening of the bank is preferably preformed by 

oligo probing r and preferably using a nucleotide 
hybridization probe selected from a region towards the 
amino terminus of the gene, as shown in Figure 2, It may 
be further desirable to employ an additional probe which 
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corresponds to a different portion of the gene, such as a 
more internal or even carboxy-terminal region* Conditions 
found to work well in the hybridization screening are 

disclose d hereinbelow. Preferred hybridization conditions 

5 for probe hybridizations in general will be fairly " 
stringent conditions of 6 x SSC, at 42 degrees for 18 
hours, followed by washing with 1 x SSC for 2 hours at 42° 
C. 

10 Once the full length gene, or a desired subportion - 

thereof, has been obtained by whatever means, recombinant 
production of Ro 60 kD antigen sequences is obtained by 

empracing~the^transcript-ionalr-uni-t— upstream— of-,— and— under 

the control of, a suitable promoter and transforming a 

15 suitable host with the constructed recombinant gene. 

Often, the recombinant transcription unit will be packaged 
into a recombinant vector, such as a plasmid, phage or 
virus, which contains an origin of replication and usually 
a selection marker sequence such as an antibiotic 

2 0 resistance marker. The inventors contemplate that either 

prokaryotic or* eukaryotic systems can be employed, as 
discussed in more detail below* Following transformation 
of an appropriate host with the- recombinant trans- 
criptional construct, the transformed host is grown under 
25 conditions selected to promote transcription and 

subsequent translation of the recombinant sequences- As 
understood by those of skill in the art the conditions 
ultimately selected will generally depend on the nature of 
the transcriptional construct and the promoter that is 

3 0 employed, 

A preferred construct for recombinant expression of 
60 kD Ro antigen sequences, whether it be the full length 
gene or subportions or variants thereof, includes the use 
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of the bacteriophage T7 RNA polymerase/promoter system, as 
described by Tabor et al * , or the use of the baculovirus 
expression system (polyhedron promoter) as described by 

Summers. The use of both of these systems is well 

5 understood by those of skill in the art of recombinant 
expression, as exemplified by references 74 through 76 o 

After expression of the recombinant 60 kD antigen 
sequence, the recombinant protein can be collected by 
10 lysis of the bacteria, and purification is by 

fractionation and column chromatographic techniques, e.g., 
as detailed for the isolation of native Ro/SS-A in Example 
■ — I— below; — 

15 Site-Specif ic Mutagenesis 

As noted above, site-specific mutagenesis is a 
technique useful in the preparation of individual 
peptides r or biologically functional equivalent proteins 

20 or peptides, derived from the 60 kD antigen sequence, 

through specific mutagenesis of the underlying DNA* The 
technique further provides a ready ability to prepare and 
test sequence variants, for example, incorporating one or 
more of the foregoing considerations-, by introducing one 

25 or more nucleotide sequence changes into the DNA« Site- 
specific mutagenesis allows the production of mutants 
through the use of specific oligonucleotide sequences 
which encode the DNA sequence of the desired mutation, as 
well as a sufficient number of adjacent nucleotides, to 

3 0 provide a primer sequence of sufficient size and sequence 
complexity to form a stable duplex on both sides of the 
deletion junction being traversed. Typically, a primer of 
about 17 to 25 nucleotides in length is preferred, with 
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about 5 to 10 residues on both sides of the junction of 
the sequence being altered. 

In general / the tec hnique of site-specific 

5 mutagenesis is well known in the art as exempl if ied by 
publications such as reference 28, incorporated herein by- 
reference. As will be appreciated, the technique 
typically employs a phage vector which exists in both a 
single stranded and double stranded form- Typical vectors 

10 useful in site-directed mutagenesis include vectors such 
as the M13 phage, for example, as disclosed by reference 
29, incorporated hereby in reference. These phage are 

reaaiTy - <»Tnm^ 

well known to those skilled in the art* 

15 

In general, site-directed mutagenesis in accordance 
herewith is performed by first obtaining a single-stranded 
vector which includes within its sequence a DNA sequence 
which encodes all or part of the 60 kD Ro antigen. An 

2 0 oligonucleotide primer bearing the desired mutated 

sequence is prepared, generally synthetically, for example 
by the method of reference 30. This primer is then 
annealed with the singled-stranded vector, and subjected 
to DNA polymerizing enzymes such as coli polymerase I 
25 Klenow fragment, in order to complete the synthesis of the 
mutation-bearing strand. Thus a heteroduplex is formed 
wherein one strand encodes the original non-mutated 
sequence and the second strand bears the desired mutation. 
This heteroduplex vector is then used to transform 

3 0 appropriate cells such as coli cells and clones are 

selected which include recombinant vectors bearing the 
mutated sequence arrangement . 
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In general, of course, prokaryotes are preferred for 

the initial cloning of DNA sequences and constructing the 

5 vectors useful in the invention. For example, colio 
DH5 or K12 strain 294 (ATCC No. 31446) are particularly 
useful o Other microbial strains which may be used include 
E. coli strains such as coli B, and E_» coli X 1776 
(ATCC No. 31537) . These examples are, of course, intended 
10 ~ to be illustrative rather than limiting » - 

Prokaryotes may also be used for expression. The 

afor^^tloned^tr^ilisT^ 

lambda-, prototrophic, ATCC No. 273325), bacilli such as 
15 Bacillus subtilus , or other enterobacteriacea such as 

Salmonella typhimurium or Serratia marcesans , and various 
Pseudomonas species may be used. 

In general, plasmid vectors containing replicon and 
20 control sequences which are derived from species 

compatible with the host cell are used in connection with 
these hosts. The vector ordinarily carries a replication 
site, as well as marking sequences which are capable of 
. providing phenotypie selection in transformed cells. For 
25 example, L coli is typically transformed using pBR 322, a 
plasmid derived from an E_o_ coli species (see, e.g., ref. 
31) o pBR 322 contains genes for ampicillin and tetracy- 
cline resistance and thus provides easy means for 
identifying transformed cells. The pBR plasmid, or other 
3 0 microbial plasmid or phage must also contain, or be 

modified to contain, promoters which can be used by the 
microbial organism for expression of its own proteins. 
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Those promoters most commonly used in recombinant DNA 
construction include the B-lactamase (penicillinase) and 
lactose promoter systems (3 2-34) and a tryptophan (trp) 
promoter system (35,3 6). While these are the most 



10 



commonly used, other microbial promoters have been 
discovered and utilized, and details concerning their 
nucleotide sequences have been published, enabling a 
skilled worker to ligate them functionally with plasmid 
vectors (3 6) . 

In addition to prokaryotes, eukaryotic microbes, such 
as yeast cultures may also be used. Saccharomyces 
~ - cerevisiars'e ^ — or~ common— baker '-s— yeast—is— the-most -commonly — 
used among eukaryotic microorganisms, although a number of 
15 other strains are commonly available. For expression in 
Saccharomyces , the plasmid YRp7, for example, is commonly 
used (37-39) • This plasmid already contains the trp l gene 
which provides a selection marker for a mutant strain of 
yeast lacking the ability to grow in tryptophan, for 
20 example ATCC No. 44076 or PEP4-1 (40) . The presence of 

the trp l lesion as a characteristic of the yeast host cell 
genome then provides an effective environment for 
detecting transformation by growth in the absence of 
tryptophan . 
25 - 

Suitable promoting sequences in yeast vectors include 
the promoters for 3 -phosphoglycerate' kinase (41) or other 
glycolytic enzymes (42-43), such as enolase, 
glyceraldehyde-3 -phosphate dehydrogenase , hexokinase , 
3 0 pyruvate decarboxylase, phosphof ructokinase, glucose-6- 
phosphate isomerase, 3 -phosphoglycerate mutase, pyruvate 
kinase, triosephosphate isomerase, phosphoglucose 
isomerase, and glucokinase. In constructing suitable 
expression plasmids, the termination sequences associated 
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with these genes are also ligated into the expression 
vector 3 ' of the sequence desired to be expressed to 
provide pblyadenylation of the mRNA and termination - 

Other— promoters wh ich have the additional advantage of 

5 transcription controlled by growth conditions are the 

promoter region for alcohol dehydrogenase 2, isocytochrome 
C, acid phosphatase, degradative enzymes associated with 
nitrogen metabolism, and the aforementioned 
glyceraldehyde-3 -phosphate dehydrogenase, and enzymes 
10 responsible for maltose and galactose utilization « " Any 
plasmid vector containing a yeast-compatible promoter, 
origin of replication and termination sequences is 
suitable. 

15 In addition to microorganisms, cultures of cells 

derived from multicellular organisms may also be used as 
hosts o In principle, any such cell culture is workable, 
whether from vertebrate or invertebtate culture* However, 
interest has been greatest in vertebrate cells, and 

20 propogation of vertebrate cells in culture (tissue 

culture) has become a routine procedure in recent years 
(44) • Examples of such useful host cell lines are VEHO 
and HeLa cells, Chinese hamster ovary (CHO) cell lines, 
and W138, BHK, COS-7 293 and MDCK cell lines. Expression 

25 vectors for such cells ordinarily include (if necessary) 
an origin of replication, a promoter located in front of 
the gene to be expressed, along with any necessary 
ribosome binding sites, RNA splice sites, polyadenylation 
site, and transcriptional terminator sequences. 

30 

For use in mammalian cells, the control functions on 
the expression vectors are often provided by viral 
material* For example, commonly used promoters are 
derived from polyoma, Adenovirus 2, and most frequently 
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Simian Virus 4 0 (SV4 0) . The early and late promoters of 
SV4 0 virus are particularly useful because both are 
obtained easily from the virus as a fragment which also 
contains the SV40 viral origin of replication (45) . 
5 Smaller or larger SV4 0 fragments may also be used, 
provided there is included the approximately 250 bp 
sequence extending from the Hind III site toward the Bgl I 
site located in the viral origin of replication o Further, 
it is also possible, and often desirable, to utilize 
10 promoter or control sequences normally associated with the _ 
desired gene sequence, provided such control sequences are 
compatible with the host cell systems. 



As origin of replication may be provided either by 
construction of the vector to include an exogenous origin, 
such as may be derived from SV40 or other viral (e.g., 
Polyoma, Adeno, VSV, BPV) source, or may be provided by 
the host cell chromosomal replication mechanism. If the 
vector is integrated into the host cell chromosome, the 
latter is often sufficient. 

EXAMPLE I 

25 The present example was undertaken to illustrate a 

preferred embodiment of the invention. The example thus 
employs laboratory techniques found by the inventors to 
work well in the context of the present invention. 
However, it will be apparent to those of skill in the art 

3 0 that various alterations and modifications, including 

changes in reagents and amounts of materials, may be made 
in these particular techniques in light of the present 
disclosure without departing from the spirit and scope of 
the invention. 



15 



20 
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A peptide, designated epitopic core sequence I (ECS- 

1) consistinq^.of_th e following seq uence of amino acids 

5 was synthesized with a cysteine residue added at its amino 
terminus, using an Applied Biosystems Model 43 OA Peptide 
Synthesizers 

-phe-lys-glu-gln-phe-leu-asp-gly-asp-gly-trp-thr- 
10 asp-arg-, - 

The peptide was then deprotected by HF cleavage. The 
peptide was then sequenced as disci osed~Helow~to~~con firm — 
the sequence o A portion of the synthesized peptide 
15 material was conjugated to keyhole limpet haemocyanin 
(KLH) o 

The following peptide, designated ECS-XX, was also 
prepared in the above manner; 

20 

-lys-glu-qln-phe-leu-asp-gly-asp-gly-trp-thr-asp- 
arg-trp-ile-glu-ser- « 

Protein Secruencincr 

25 

The above antigen peptide preparations were subjected 
to peptide sequencing by automated Edman degradation using 
either a gas phase Model 470 Applied Biosystems Sequencer 
(Foster City, CA) with the Model 120, on-line HPLC PTH 
3 0 amino acid identification system, or a Beckman spinning 
cup Model S90M sequencer (Palo Alto, CA) • In the later 
case, the PTH amino acids were identified using a Nova Pac 
column in a Waters Model 840 HPLC system (Millford, MA) « 
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Antigen source . The Ro/SS-A antigen was isolated 
from an extract of an Epst ein-Barr virus transformed human 
5 B-lymphoblastoid cell line (Wil-2) * The cells were grown^ 
in Eagle's medium supplemented with 2mM glutamine; sodium 
pyruvate, non-essential amino acids, 10% fetal calf serum, 
penicillin (10,000 U/ml) and streptomycin (10 mg/ml) • The 
cells were centrifuged at 35 xg for 12 minutes and washed 

10 with phosphate-buffered saline (0.1AM NaCl, 0.01M 

phosphate, pH 7.4) (PBS) 3 times. The packed cells were 
mixed with the same volume of PBS containing lmM phenyl- 

methyl - ^Xfenyl— f-luoride— (-PMS-F-)-. The-~suspens.ion_w.as 

sonicated on ice with ten 15-second pulses using a Heat 

15 System Sonicator at a setting of 9. The sonicate was then 
centrifuged at 12,100 xg for 1 hour and the supernatant 
subjected to ammonium sulfate precipitation as described 
by Lieu et al. (17) . 

2 0 The ammonium sulfate fractions were pooled and 

dialyzed against 24mM borate buffer, pH 1.6, and applied, 
to a polybuffer ion exchange column. After washing the 
column with 24mM borate buffer (eluate with OD2 8 0 > 0,1), 
a stepwise sodium- chloride gradient was applied to the 

2 5 column. The majority of La/SS-B antigen was eluted in the 

0,1M and 0,2M sodium chloride fraction whereas all the 
Ro/SS-Ar antigen was recovered in the 0.5M and 1M sodium 
chloride fractions. After concentration by ammonium 
sulfate precipitation, a modest amount of La/SS-B activity 

3 0 was also detected in the 1M NaCl fraction by counter- 

immunoel ectrophores i s against prototypic antisera. 

The antigenically active material from the PBE column 
(0.5M and 1M NaCl fraction) was further purified* by 
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electrophoresis in a 5.6% native polyacrylamide gel (N- 
PAGE) • After electrophoresis , the gel was divided into a 
series of 10mm slices and the material was eluted from the 

ge-lr-with— distilled- water- By^counterimmunoe lectrophoresis 

5 (CIE) , maximal Ro/SS-A antigen activity was located in the 
fraction with an Rf of 9,0 whereas the maximal La/SS-B 
activity was recovered in the region with an Rf of 0o7„ 
When the antigenically active fraction from the N-PAGE was 
subjected to SDS-PAGE it contained a single stained band 

10 of 60 , 000 molecular weight whose identity was confirmed by 
Western blot analysis « 



Human antisera * Anti-Ro/SS-A sera were selected~by 
the presence of a single precipitin line in Ouchterlony 

15 analysis having complete identify with prototypic sera 
used in studies of Lieu et al„ (17) • The absence of 
antibodies to other nuclear antigens such as La/SS-B, Ul- 
RNP and Sm was also confirmed by ELXSA, Monospecific 
anti-Sm and anti-La/ SS-B human autoimmune sera were 

20 originally obtained from the laboratory of Dr. E.M, Tan 

(Scripps Research Institute, La Jolla, CA) . Each formed a 
single precipitin line in double immunodiffusion analysis 
with the appropriate antigens and did not react with 
purified Ro/SS-A in the ELISA. 

25 

Immunization Protocol 

Female rabbits (New Zealand, White) were immunized 
subcutaneous ly in the neck region with 0.5 mg of peptide 
30 conjugated to KLH emulsified in Freund's complete adjuvant 
(FCA) • At one week, the animals were similarly immunized 
with 1 ug of the unconjugated peptide alone. At monthly 
intervals thereafter the rabbits were boosted with the 
KLH-conjugated peptide emulsified in FCA, 
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The native Ro/SS-A antigen (5 ug/ml) or synthetic 

pe ptide ( 10 ug/ml) in PBS was added to wells of a 

5 microtiter plate and incubated 15 hours at 4 a C, The 

plates were washed with PBS-Tween and the remaining sites 
were coated with 1% BSA for 1 hour. After washing 3 times 
with PBS-Tween, sera diluted with 1% BSA, 0o5% bovine 
gamma globulin (BGG) in PBS-Tween were added and incubated 

10 for 2 hours. The plates were then washed- 3 times with. 
PBS-Tween. Peroxidase conjugated goat-anti-human IgG 
diluted Is 3000 in PBS-Tween containing 1% BSA and 0,5% BGG 

was ad^^~~aSa~incu^^^ 

The plates were washed in a similar manner. The color was 

15 developed by adding a peroxidase substrate solution 
containing 1 mg/ml of 2,3' -azino-di- (3-ethyl- 
benzthiazoline-S-sulfonic acid) (ABTS) and 0.005% K 2 °2 in 
OolM Mclvaine's buffer, pH 4.6. The optical density was 
measured using a Titertek Multiskan brand ELISA plate 

2 0 reader o 

To determine the proportion of the total native 
Ro/SS-A antigenic activity that is present on the 
synthetic peptides, increasing amounts of KUi-coupled 

2 5 synthetic peptide were preincubated with a monospecific 

patient anti-Ro/SS-A serum. After a 16 hours incubation 
at 4°C, the inhibited serum aliquots were diluted to the 
appropriate concentrations and added to a native Ro/SS-A 
antigen-coated plate. The degree of inhibition was then 

3 0 calculated by comparison with the reactivity of untreated 

anti-Ro/SS-A serum o 
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Human autoimmune serum to Ro/SS-A was diluted 1:100 

and— reacted— with— both-: the^synthetic pe ptide ECS— I and 

5 ECS— I conjugated to KLH (Table III) . Whereas the binding 
to anti-Ro/SS-A serum was quite strong, reaction of the 
synthetic peptides with human autoimmune sera specific for 
other antigens such as Sm, La/SS-B and normal human sera 
did not produce significant binding. Furthermore , the 
10 binding of anti-Ro/SS-A sera to the ECS-I peptide was 

completely blocked by preincubating the sera with native 
Ro/SS -A antigen. Finally, anti-Ro/SS-A sera which show 
strong binding to the ECS-I did not react in ELISA with 
control polypeptides such as ribonuclease A or an 
15 unrelated synthetic peptide conjugated to KLH. 



TABLE III 



20 



REACTIVITY OF ECS-I WITH HUMAN AUTOIMMUNE SERA 



25 



Immobilized Antigen 



Antibody 



ELISA OD405 



KLH-Synthetic 
3 0 peptide 
ECS-I 



Anti-Ro/SS-A 
Anti-Sm 



Anti-La/SS-B 
Normal serum 



1.823 
0.288 
0.355 
0.263 



35 



Synthetic peptide 
ECS-I 



Anti-Ro/SS-A 
Anti-Sm 



1. 059 
0.236 
0.158 
0.118 



Anti-La/SS-B 
Normal serum 
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Moreover, the ECS peptides demonstrated a strong 
ability to compete with the native Ro/SS-A RNP particle 
for binding to anti-Ro antisera. Exemplary studies 
showing the inhibition of binding of a monospecific human 



anti-Ro/SS-A serum to native Ro/SS-A antigen in ELISA by 
synthetic peptide ECS -II is shown in Table IV/ 



10 



15 



20 



25 



30 
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TABLE IV 



INHIBITION OF THE BINDING OF A MONOSPECIFIC 
HUMAN ANTI-RO/SS-A SERUM TO NATIVE Ro/SS-A 
ANTIGEN BY KLH-ECS-II 



KLH-ECS-II (ug) 


Percent Inhibition 


2 


11. 9 


6 


17.0 


8 


27.3 


12 


28.4 


16 


39 .2 


24 


29 . 0 


30 


32.0 



40 



In order to further characterize the Ro/SS-A antigen, 
an antiserum to ECS-I was raised by immunizing a rabbit 
with KLH-ECS-I. The antibody level in rabbit serum 
(diluted 1:100) was measured by ELISA against native human 
Ro/SS-A antigen as well as the synthetic peptide. 
Elevated binding to Ro/SS-A antigen (0D 4Q5 = 0*1565 for 
pre-immune rabbit sera) was detected. The binding of 
rabbit anti-peptide ECS-I to native human Ro/SS-A antigen 
was quantitatively inhibited by KLH ECS-I (Table V) . The 
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results of these experiments indicate that the epitope 
represented by the ECS-I sequences corresponded to an 
epitope on the outer surface of native Ro/SS-A. It is 

possible, h owever r that distortion might occur when 

5 coating the native Ro/SS-A protein to the ELISA plate, 
allowing the ECS-I epitope to become relatively more 
exposed. This is also consistent with the finding that 
' some prototypic human autoimmune sera to Ro/SS-A antigen 

reacted with the synthetic peptide., 

10 - - 

TABLE V 

ANTISERUM TO NATIVE HUMAN Ro/SS-A ANTIGEN 



20 


KLH-ECS-I 
(ug) 


Percent Inhibition 


25 


8 


79 




10 


88 




16 


98 



3Q 

From the foregoing examples, it will be apparent to 
those of skill in the art that the two epitopic core 
sequences, ECS-I and ECS-II, comprise useful functional 
antigenic equivalents of the native Ro/SS-A RNP particle 

35 antigen. For example, serum from rabbits immunized with 
KLH-ECS-I showed antibody activity toward native human 
Ro/SS-A antigen, confirming that the ECS-I peptide 
possessed the combination of properties which are 
essential for antibody binding. The sites most frequently 

40 recognized by antibodies form three-dimensional ^uper- 



WO 89/09273 ^ • PCT/US89/01213 

assemblies characterized by high local mobility, convex 
surface shapes, and negative electrostatic potential (18) • 
The sequence data indicates that the ECS peptides carry a 

ne gative charg e. Delineation of the exact location of 

5 sub-epitope(s) within the ECS sequences which binds to 
human autoantibody and to antibodies raised in rabbits 
will be of interest. In this way, the influence of the 
microenvironment on antigenic sites within peptides can be 
approached. In addition, the antiserum to KLK-ECS will be 

10 useful in determining the tissue distribution as well as 
cellular localization of native human Ro/SS-A antigen. 
Also, antiserum to the ECS peptides could be used as a 

pTO&eT~f^~i7ien^^ 

Ro/SS-A molecule following proteolytic cleavage. Such 

15 studies could provide further insight regarding the 
structure of the native molecule. 



. Relatively few amino acid sequences of "autoantigens" 
that react with autoimmune sera from patients with 
rheumatic disease have yet been elucidated. The carboxyl 
terminal 55 amino acids of the La/SS-B antigen was 
recently identified by analyzing overlapping cDNA clones 
(19) o There is no apparent sequence homology between the 
ECSs of the present invention and these sequences of 
La/SS-B. 

Synthetic peptides have been widely applied as probes 
for the study of DNA-^binding sites on protein (21) , T and 
B-cell recognition sites on protein antigens (18, 21-24), 
3 0 and peptide binding sites on la molecules (25) . The 
results of recent studies using synthetic peptides in 
combination with crystallographic studies indicate that 
initial binding to solvent exposed amino acid residues may 
promote local side-chain displacements and thereby allow 
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20 



25 



the participation of other, previously buried residues 
(26) . The fact that synthetic ECS -I reacted with mono- 
specific antibodies to Ro/SS— A in human autoimmune sera 
indicates a primary contact amino acid residue (s) is 
located on this peptide • Indeed, data indicate that the 
tripeptide, -asp-gly-trp-, is likely involved in initial 
antibody recognition <> 

It is proposed that the use of synthetic peptides in 
accordance with the invention will likely "provide" a " 
superior method than is currently available for detecting 
anti-Ro/SS-A antibodies in the sera of patients with 

-autoimmune— diseases The - advanta"ges~pirWl"ded _ By^~tHe 

synthetic peptide include the availability of a quality 
controlled antigen in large amounts, the ability to 
automate the procedure and the lower background and higher 
sensitivity of a synthetic peptide-based ELISA technique. 
Present anti-Ro/SS-A assays are complicated by the recent 
observation which suggests that some epitopes on the 
Ro/SS-A antigen are cross-reactive with IgG (27). The use 
of synthetic peptides to mimic Ro/SS-A epitopes could 
provide advantages since the two different types of 
epitopes (Ro/SS-A only vs* Ro/SS-A plus IgG) could be 
separately analyzed* In fact, recent preliminary results 
from the inventors laboratory suggest that sera from 
patients with different clinical disorders such as SCLE 
and Sjogren's syndrome and children with congenital heart 
block show different frequencies of reactivity to the 
ECS-I epitope o The idea that variability might exist in 
the autoimmune response to different epitopes on the same 
molecule is supported by. recent studies which have 
indicated that rabbits actively immunized against 
myohemerythrin also demonstrate a variable antibody 
response to different epitopes on this polypeptide (19) 0 
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In this example is described the screening of a human 
hybridoma cell complementary DNA (cDNA) library and the 

5 isolation of aT^DNA "clone which encodes a 60 kD Ro auto 

antigen. As shown in Figure 2, the 1890 base pair 
recombinant insert contained an open reading frame that 
encoded a 417 amino acid polypeptide which included the 18 
amino acid sequence Ro epitopic core sequence addressed 

10 above in Example I. The coding region begins with_an AUG 
codon as part of a sequence which bears homology to the 
euJcaryotic ribosomal consensus sequence for the initiation 

of-transiat-ion-o T-he— ^ initial-methionine„was_f Allowed _by_a^ 

strongly hydrophobic 16 amino acid leader segment* There 

15 was a 66 base pair 5' untranslated segment and a 573 base 
pair 3' untranslated segment which begins with a UAG 
termination codon and included a single putative poly- 
adenylation signal. The nucleic acid sequence and its 
encoded amino acid sequence did not bear strong homology 

20 to other published sequences • Southern filter hybridisa- 
tion analysis indicated that this gene was not highly 
polymorphic and existed as a single copy in the human 
genome. Chromosomal localization studies place this gene 
on the short arm of chromosome 19 near the LDL receptor 

25 gene. 

MATERIALS AND METHODS 

3 0 The enzymes used in the various recombinant nucleic 

acid techniques were obtained from Promega Biotec, Madison 
WI or Pharmacia, Inc., Piscataway, NJ, unless stated 
otherwise- 
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Protein Purification and Sequence Analysis 

The Ro protein antigen was purified from the human 
Wil-2 cell line (an Ebstein-Barr virus transformed lympho- 



5 blastoid B-cell line) as described (46) * Staphlococcus 
aureus V8 (Berhinger Mannheim Biochemicals , Indianapolis , 
IN) and cyanogen bromide (Sigma Chemicals Co« , St. Louis, 
MO) cleavage fragments were generated (47) and sequenced 
on an Applied Biosystems 47 OA protein sequencer/ 12 OA PTH 
10 Analyzer (Applied Biosystems, Foster City, CA) , as 
previously described (46) . 



Deqlycosylation Analysis ~~~~~ 

15 The purified Ro protein was digested with 

Neuraminidase, Endo-a-N-Acetylgalactosaminidase and 
Glycopeptidase F according to the manufacturer's 
recommendations (Boehringer Mannheim Biochemicals) , and 
then subjected to sodium dodecyl sulf ate-polyacrylamide 

2 0 gel electrophoresis (SDS-FAGE) • - 

Synthetic Oligonucleotide Construction 



A codon utilization table was employed to convert the 

25 amino acid sequence data into its most probably nucleic 

acid sequence (48) . The oligonucleotides were synthesized 

on an Applied Biosystems 3 8 OB DNA synthesizer „ 

cDNA Library Construction 

30 

Total RNA was isolated from the Wil=2 cell line by 
the guanidinium method and enriched for the polyadenylated 
(poly-A) fraction with an oligo(dT) -cellulose column (49) e 
cDNA was made from the poly-A enriched fraction #ith the 
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cDNA Synthesis System (Bethesda Research Laboratories 
(BRL) , Gaithersburg, MD) • The cDNA was dG-tailed with 
dGTP and terminal transferase and ligated into similarly 
dC-tailed pGEM plasmid DNA with T4 DNA ligase (50) . DH5 

5 Escherichia coli competenlT~ceris~~(BRL)^were~trans formed 

with the cDNA-pGEM ligation mixture and a cDNA library was 
constructed (49) . A human hybridoma cDNA library was 
similarly constructed * 

10 cDNA Isolation - 

The synthetic oligonucleotides were radiolabeled and 

hybr-idd-zed— wi-th—ni-tr o c el lul o s.e-f.il-ter.s_to_which_..the cDNA 

containing bacterial colonies had been fixed. (49). A 
15 single colony containing a 1.2 kilobase (kb) cDNA insert 
was isolated- .Later this 1.2 kb cDNA was radiolabeled and 
used to screen a human hybridoma cell cDNA library, in 
which a single 1.9 kb cDNA was isolated. 

20 cDNA Characterization 

Restriction enzyme analysis: The 1.2 kb cDNA was 
digested with various restriction enzymes and the 
restriction fragments were analyzed by Southern filter 
25 hybridization with the radiolabeled synthetic oligonucleo- 
tides (50) . 

Sequencing: Several of the cDNA restriction frag- 
ments were electroeluted from a 1% agarose gel and 
3 0 subcloned into M13mpl8 and M13mpl9 plasmid vectors 

(Berhinger Mannheim Biochemicals) and single stranded DNA 
complementary to both strands of cDNA was produced (51) . 
This DNA was sequenced by the Sanger dideoxy method with 
35S ATP and Sequenase according to the manufacturer's 
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recommendations (United States Biochemical corp • , 
Cleveland, OH) . 



-Northern-Filter^Hvb ridizations Total RNA and poly- A 

5 enriched RNA from several human blood cell lines (obtained 
as outlined above) were electrophoresed in a 1% agarose- 
formaldehyde gel, electrophoretically transferred to 
Zeta-Probe nylon reinforced support membrane according to 
the manufacturer's guidelines (Bio-Rad Laboratories, 
10 Richmond, CA) , hybridized with radiolabeled cDNA and then 
washed at 65 degrees Celcius in 0.24 x SSC (1 x SSC is 
0.15M NaCl and OoOISM sodium citrate, pH 7,0) and 0.1% 
sodium dodecyl sulfate (SDS) (52T- ~ 

15 Southern Filter Hybridization: 15 ug of human 

genomic DNA was digested with various restriction enzymes, 
separated by 0.6% agarose gel electrophoresis and trans- 
ferred to a nitrocellulose support membrane where it was 
hybridized with radiolabeled cDNA« Washing was performed 

20 at 0.5 x SSC, 0.11 SDS and 65 degrees Celcius (50). 

Rad io 1 abel i ncr 

Synthetic oligonucleotides were end labeled with 
25 gamma 32P ATP using T4 polynucleotide kinase (50) • cDNA 
was radiolabeled using the heximer extent ion method with 
heximer primers (Pharmacia, Inc.), alpha 32P dCTP and 
E» coli DNA polymerase I (Klenow fragment) (50) „ 



Radionucleides were obtained from New England Nuclear 
Corp., Boston,, MA. 
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Filters were exposed to Kodak X-OMAT-AR film for an 

optimal period of time, and the film was then developed on 

5 a Konica QX-60A film processor. 

Chromosomal Localization 

Somatic cell hybrid clone panels were formed by 
10 polyethylene glycol mediated fusion of human lymphocytes 
to Chinese hamster ovary cell lines that were defective 
for various DNA repair capabilities. Cytogenetic analysis 

was usedT "EcTdS t erm irTe~~the~p r e s enc e— o r~ab s enc e-o-f— h-uma-n 

chromosomes in each of the hybrid clones so formed. 
15 However, due to frequent human chromosomal alterations in 
these clones, the human chromosomes were more definitively 
detected by analysis of isoenzyme and DNA markers (53,54). 
Probes for complement component 3 (3) and low density 
lipoprotein receptor (LDLR) were used to identify the 

2 0 short arm of chromosome 19. 

Computer Based Sequence Analysis 

The 1.9 kb cDNA nucleic acid sequence and its encoded 
25 amino acid sequence were analyzed for homologies to other 
published sequences. This was done with the University of 
Wisconsin Computer Genetics Group's Genetics Analysis 
software and the FAS TA/ FAS TP programs. The nucleic acid 
sequence was compared to the European Molecular Biology 

3 0 Lab data base-Version 13 (April 1988) and the Genebank 

database-Version 56 (July 88) . The protein sequence was 
compared to the National Biomedical Research Foundation 
data base-Version 13 (March 1988) (55), 



WO 89/09273 



RESULTS 



PCT/US89/01213 



AMINO ACID SEQUENCING AND SYNTHETIC 
OLIGONUCLEOTIDE CONSTRUCTION 

5 

A 60 kD protein with Ro antigenic activity was 
isolated from the Epstein-Barr virus transformed human 
Wil-2 B-cell line and subjected to a limited Staphlococcus 
aureus V8 protease digestion. This produced 23 and 37 kD 
10 fragments which were identified by SDS-PAGE. The amino 
terminal end of each domain was sequenced and this 
sequence information was used to construct two non- 

-degenerate- synthetic-ol-igonueleo^ 

as two different synthetic peptides o 

15 

cDNA ISOLATION AND SEQUENCE ANALYSIS 

A single 1.2 kb cDNA clone was isolated from the 
Wil-2 cell cDNA library . This clone was characterized by 

20 restriction enzyme analysis and sequenced* The cDNA 

encoded the previously determined amino acid sequences but 
the reading frame was open to the end of the cDNA with no 
termination codon, indicating that the cDNA identified a 2 
kb RNA species but no 1.2 kb species r confirming that the 

25 cDNA was abbreviated. A human hybridoma cDNA library was 
subsequentlv screened with the 1.2 kb cDNA and a single 
1.9 kb cDNA clone was isolated and sequenced. The first 
1,238 base pairs of this clone are identical to the entire 
sequence of the 1.2 kb clone. The 1.9 kb clone contains 

30 1,890 base pairs which includes a single 1,251 base open 
reading frame beginning with an AUG start site at position 
67 as part of a putative Kozar ribosomal translation 
initiation site and ending with the termination codon UAG 
(Figure 2) (56). The sequence AUUAAA (Figure 2) -is a 
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putative polyadenylation signal (57) , but there is not a 
typical poly-A sequence between this signal and the end of • 
the cDNA sequence, suggesting that this 1.9 kb cDNA may be 
of incomplete length. 



The encoded polypeptide has a molecular weight (MW) 
of about 48 kD which ' includes a 17 amino acid hydrophobic 
leader segment that is not present in the purified 
protein. The MW of the encoded polypeptide without the 

10 leader segment is approximately 14 -kD less than that of _ 
the native 60 kD protein as measured by SDS-PAGE. The 
amino acid sequence contained no potential sites for N- 

1-i-nked— giycosyia-tion— and— deglycosy-lation_analysis_of_the 

purified 60 kD protein shows no evidence of N or O-linked 

15 glycosylation. A highly negatively charged region is 

common to these proteins and may account for an incorrect 
SDS-PAGE MW estimation through abberant gel migration « 
The calculated isoelectric point of this polypeptide was 
4.14 which closely approximates the value of 4,67 measured 

20 from the native purified protein (58) . 

This protein contained two different sets of internal 
repeating sequences (Figure 3b) which may have arisen from 
internal duplications and be of functional importance «. 

25 The first set of duplications has 67% of its nucleic acid 
sequence and 73% of its amino acid sequence conserved, 
whereas the second set had a 60% nucleic acid and 64% 
amino acid sequence conservation. This protein also 
contained several regions (Figure 3a) which might allpw 

3 0 rapid degradation as proposed by Rogers et al. for 

proteins containing PEST regions; regions rich in proline 
(P) , glutamic acid (E) , serine (S) , and/or threonine (T) 
and to a lesser extent aspartic acid (D) (59) . 
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There was no striking sequence similarity to other 
RNA binding proteins including another recently sequenced 
Ro cDNA (60) . There was no major homology to the RNP 
consensus sequence (61) and no zinc finger (62) or leucine 



5 zipper (63) nucleic acid binding motifs. 

A computer based analysis of the nucleic acid 
sequence showed no striking homology to other sequences, 
but the negatively charged carboxy terminus region had 
10 some minor amino acid sequence homology with a number of 
other proteins of diverse origin and function o The most 
striking of these homologies was with ubiquinone cyto- 
chrome^ reductase residues 50 - 7B~ where 24 out~of~29~ 



residues were a perfect match or a Asp for Glu or Glu for 
15 Asp switch with Ro residues 385 - 413, The carboxy- 
terminal sequence Lys-Asp-Glu-Leu (KDEL) followed the 
negatively charged region and was identical to the carboxy 
signal sequence which has been shown to be crucial for the 
retention of several proteins in the endoplasmic reticulum 
20 (64) . These other proteins also had a highly negatively 
charged region just proximal to the KDEL sequence. The 17 
amino acid hydrophobic leader sequence was similar to that 
of a number of other precursor proteins and indicated that 
this protein may be modified- in the endoplasmic reticulum 
25 (65) . 

Chou-Fasman secondary structure analysis predicted a 
complex secondary structure (Figure 4) which included 
several helix- turn-helix units (centered at residues 57, 
30 70., 210, 233 and 246), characteristic of some nucleic acid 
binding proteins (66), Three of these units were found 
within the internal duplications between residues 207 and 
300 (Figure 3), one unit per duplication There were also 
several beta sheet-rich areas between residues 1-17, 144- 
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186 and 285-333, The carboxy terminal residues 349-417 
were predicted to have an alpha helical array. 



Goldman et al« and Kyte-Doolittle hydropathic 
5 analysis predicted a strongly hydrophobic leader segment 
and several smaller regions of hydrophobic ity , including 
an area just proximal to the negatively charged carboxy 
terminal residues which could be a membrane spanning 
region* This analysis also predicted several strongly 
10 hydrophillic regions particularly between amino acids 
210-300 and 350-417 . The later sequence spanned the 
negatively charged carboxy end of the polypeptide. 



Jameson-Wolf antigenicity analysis (Figure 4) 
15 predicted the location of several potential epitopes 
including the previously characterized epitope at the 
amino terminus of this polypeptide (67,68), 



SOUTHERN FILTER HYBRIDIZATION ANALYSIS 

20 AND CHROMOSOMAL LOCALIZATION 



Southern filter hybridization of Eco Rl digested 
genomic DNA from ten normal individuals showed a single 
13.5 kb hybridizing fragment ♦ Several other restriction 

25 enzyme digests were similarly analyzed, with the pattern 
of bands showing no difference between individuals, 
suggesting that the gene is not highly polymorphic and 
exists as a single copy. A similar analysis using several 
different radiolabeled portions of the 1.9 kb cDNA allowed 

3 0 the construction of a genomic restriction map as shown in 
Figure 5. This Ro gene occupied approximately 6 kb of 
genomic DNA indicating that introns may account for about 
4 kilobases of this gene. 
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Depicted in Figure 6 is a genomic intron/exon map of 
the 60 kD Ro antigen gene, showing the relative location 
and size of introns 1-8 (I - I g ) , exons 1-9, gene 

regulator y promoter elements ("PEs"), and a 

5 polyadenylation site ( if AUUAAA" ) . Shown in Figure 7 is the 
relative location of various of the 60 kD Ro antigen gene 
promoter elements, including "GC" and "TATA" and "CCAAT" 
sequences. The relative position of the promoter elements 
with respect to the ,f ATG" start codon is also shown. 

10 Table VI below characterizes the exons and introns of the" 
Ro antigen gene in some particular detail, .including some 
comparisons to consensus exon/intron sequences. The 

~~ " information shown in~Flgures~6 and~7 was derived" "from 

mapping/sequencing experiments involving the Ro cDNA and 

15 genomic DNA. 
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The Ro cDNA was used for chromosome location by- 
Southern filter hybridization analysis of Hind III 
digested DNA extracted from 38 independently derived human 

x_jChinese h amster ovar y (CHO ) somatic cell hybrids that 

5 had. randomly segregated human chromosomes. Ro cDNA 
hybridized to both human and CHO DNA fragments and the 
resolvable difference in fragment size (human at 19-20 kb 
and CHO at 5.7 kb) macte it easy to determine the presence 
or absence of human genomic DNA among the hybrid clones • 
10 The low discordancy between Ro-hybridizing human sequences 
and human chromosome 19 (8%) and the apparent random 
association between Ro-hybridizing human sequences and 

every otfier~human chromosome C3~4"%^67%^^"iscor^iancy) 

suggested a chromosome 19 location of this gene, 

15 

The markers used to determine the presence or absence 
of human chromosome 19 in the hybrids (FEPD and GPI ) were 
both located on the long arm of this small, slightly sub- 
metacentric chromosome. Therefore, the three clones 

2 0 (1HL14, 9HL9, and 24HL8) discordant for the chromosome 19 

markers and Ro suggest that Ro might be on* the short arm 
of the chromosome o This hypothesis was tested by 
examining those three hybrids for the presence of known 
chromosome 19 short arm markers, C3 and LDLR , The results 
25 indicated that Ro was perfectly concordant with LDLR in 
this set, clearly placing the gene on the short arm of 
chromosome 19. The discordance of LDLR with C3 —PEPD— GPI 
in 24HL8 is consistent with, and therefore gives hybrid 
clone data to support, the linkage data placing LDLR 

3 0 distal to C3 • 
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The above-described cDNA clone encoded a Ro RNP 
autoantigen, a conclusion supported by the fact that the 
5 encoded polypeptide included an amino acid sequence shown 
to contain a major Ro epitope (see Example I) • Not all Ro 
antisera, as defined by conventional immunodiffusion 
assays, are reactive to this epitope, however (67,68). It 
now appears that there are multiple Ro epitopes some of 

10 which may be common to several Ro autoantigens whereas 
others may be unique to specific Ro autoantigens- Ro 
antisera are heterogeneous in recognizing one or more if 

the— dif f erent— epitopes— (-6-9-r 7-0-)— Whether^-or— not -certain 

patients with Ro antibodies can be clinically categorized 

15 by which epitopes their sera recognizes and whether or not 
this is related to their HLA type has yet to be deter- 
mined. The ability to categorize patients based on Ro 
epitope recognition could have great clinical utility if 
the patients clinical course and/or response to therapy 

2 0 could be predicted by these results. 

The molecular weight disparity between that measured 
by SDS-PAGE and that calculated from the encoded poly- - 
peptide is not difficult to reconcile in light of similar 
25 discrepancies reported with a number of other proteins, 
which have a very negatively charged region similar to 
this Ro polypeptide. This negatively charged region 
apparently is responsible for the retarded gel migration 
observed with these proteins. 

30 

The absence of a typical poly-A tail at the 3 ' end of 
the 1.9 kb cDNA shown in Figure 2 suggests that, although 
it contain the entire coding region, it may be truncated 
in its 3' non-coding sequences. This may have arisen from 
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aberrant cDNA synthesis or from subsequent deletion of the 
poly-A tail after cDNA synthesis. Another explanation 
would be that the Ro mRNA is not poly-A tailed, like 

his tone mRNA. However, there is no comparable 3 ' end 

5 processing signal sequence as found in the histones and 
the 1.9 kb clone does have a poly-A signal sequence. 

No major similarities were found between the RNP 
consensus sequence and the encoded amino acid sequence of 

10 our cDNA (61) . "However., the RNP consensus sequence is" 
not necessarily a requirement nor a universal property of 
RNA binding proteins for it is absent in ribosomal 
proteins^ - in many viral~imA-l^indlng nucleocapsilin^^te"ins — 
and in the Sm-D RNP autoantigen (71,72). The three 

15 duplications between residues 207-255 have a helix-turn- 
helix configuration characteristic of some nucleic acid 
binding domains and may be a site of RNA binding. It is 
_ also of note that some viral nucleocapsid and envelop 
proteins have a limited sequence homology to this stretch 

2 0 of internal duplications as determined by computer based 

protein sequence analysis. 

The hydrophobic leader segment of the Ro 60 kD 
polypeptide suggests that this protein undergoes trans- 
25 membrane transport. This sequence may serve to transport 
this protein across the endoplasmic reticulum for 
modification r however there is no evidence of 
glycosylation. The KDEL carboxy signal sequence suggests 
that this protein may reside in the endoplasmic reticulum. 

3 0 However, indirect immunofluorescence microscopy on 

cultured fibroblasts and Wil-2 cells with human Ro 
antisera, which has been shown to react with this poly- 
peptide, reveals predominantly intranuclear particulate 
staining and the polypeptide contains the sequence 
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PPKKIKDPD (residues 2 03-212 in Figure 2) which is very 
similar to nuclear targeting signals of other nuclear 
proteins (62,63). This signal sequence may facilitate 
transport of this protein into the nucleus, 

5 

There has been mounting evidence to support a role of 
foreign antigens in triggering an inappropriate immune 
response against self-antigens through molecular mimicry 
(73) . An initial computer search for sequence homology to 
" 10 microbial agents has not been- fruitful. As the Ro 

epitopes become better defined it may become more apparent 
whether microbial agents play a role in the pathogenesis 
of ~th±s~au t oimmune— r e sp o ns 

15 The relationship between this Ro protein and the 

others is unknown* Whether they are structurally or 
functionally related has not been determined and the RNA 
binding properties of each of the Ro proteins has not been 
well defined. Ro antisera specific for a 52 kD or a 60 kD 

2 0 protein have been shown to immunoprecipitate the hY RNAs 

from cellular extracts (70) „ Protein binding to a hY RNA 
has also been demonstrated in reconstitution studies with 
another recently characterized 60 kD Ro RNP, through the 
efficiency of reconstitution was reportedly quite low 
25 (60) o 

Now that several different proteins with Ro 
antigenicity have been identified, including another 60 kD 
protein which appears to bind hY RNA, the term Ro (or SS- 

3 0 A) should probably not be used exclusively for any one of 

these proteins* However, to avoid confusion with existing 
nomenclature the present inventors have continued to refer 
to the 60 kD antigen of the present invention as the "60 
kD Ro antigen". Yet, as each of these Ro autoahtigens are 
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further characterized, a system of classification should 
evolve so that each gets a more unique designation. The 
characterization of the various Ro cDNAs and their encoded 
epitopes should be helpful in this regard and also provide 
5 a means to further clarify the functional and pathologic 
roles of these protein autoantigens « 

***** 

"10 It will be apparent- to those of skill in the art that 

numerous modifications and changes may be made in the 
present invention in light of the present disclosure 

withisut~departing"^ the 

invention. For example, it will be apparent in light of 

15 the present disclosure that numerous approaches can be 
taken in expression of the 60 kD Ro coding sequences, or 
sub fragments thereof, to produce the 60 kD protein or 
antigenic fragments- Similarly, for example, in 
connection with immunoassays, although the present 

20 inventors have chosen the ELXSA system with a peroxidase 
enzyme tag, there is no reason why other systems such as 
an RIA-based immunoassay system, or the use of different 
enzymes, such as alkaline phosphatase, urease or even DNA 
capture, cannot be used with equal utility „ Numerous * 

25 other changes will be equally apparent. It is intended 
that all such changes be within the spirit and scope of 
the claims which follow. 
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1. A recombinant vector comprising a recombinant DNA 
~5 insert encoding a peptide wlii^fiTincIudes at - least - an 

epitopic core sequence of 60 kD Ro antigen, or a biologi- 
cally functional equivalent thereof. 

- 10 2. The recombinant vector of claim 1, wherein the .60 kD _ 
Ro antigen is identified as having an amino acid sequence 
essentially as set forth in Figure 2 . 



15 3. The recombinant vector of claim 2, wherein the 

epitopic core sequence comprises amino acids 2 4 through 3 6 
of Figure 2. 

20 4. The recombinant vector of claim 3, wherein the 

epitopic core sequence comprises amino acids 23 through 3 6 
of Figure 2 . 

25 5. The recombinant vector of claim 2, wherein the 

epitopic core sequence comprises amino acids 188 through 
209 of Figure 2. 

30 6. The recombinant vector of claim 2, wherein the 

epitopic core sequence comprises amino acids 2 41 through 
255 of Figure 2. 



WO 89/09273 PCT/US89/01213 

70 

7. The recombinant vector of claim 2, wherein the 
recombinant DNA insert encodes a 60 kD Ro antigen having 
an amino acid sequence essentially as set forth in Figure 

2 / _or_a_hioloc rically functional equivalent thereof . 

5 

8* The recombinant vector of claim 1, wherein the 
recombinant DNA insert encodes a peptide of from about 13 
to about 25 amino acids in length. 

10" ~ " ' - - 

9. A substantially purified DNA segment comprising a DNA 
sequence encoding a pepCili^^iri^h~il^llid^^t~l^ast~a"n^ 

• epitopic core sequence of 60 kD Ro antigen, or a biologi- 
15 cally functional equivalent thereof. 

10. The DNA segment of claim 9, wherein the 60 kD Ro 
antigen is identified as having an amino acid sequence 

20 essentially as set forth in Figure 2. 

11. The DNA segment of claim 10, wherein the DNA sequence 
encodes a peptide which includes an epitopic core sequence 

25 comprising amino acids 24 through 36 of Figure 2. 

12. The DNA segment of claim 11, wherein the DNA sequence 
encodes a peptide which includes an epitopic core sequence 

30 comprising amino acids 23 through 36 of Figure 2. 



WO 89/09273 ? % PCT/LS89/01213 

13. The DNA segment of claim 10, wherein the DNA sequence 
encodes a peptide which includes an epitopic core sequence 
comprising amino acids 188 through 209 of Figure 2. 



5 

14. The DNA segment of claim 10, wherein the DNA sequence 
encodes a peptide which includes an epitopic core sequence 
comprising amino acids 241 through 255 of Figure 2. 

10 - _ . _ . 

15. The DNA segment of claim 10, wherein the DNA sequence 
encodes a 60 kD Ro antigen having an amino acid sequence 

" iBssentTiaTly as se1TTeort]r~iTi~Figure^ 

functional equivalent thereof. 

15 

16. The DNA segment of claim 9, wherein the DNA sequence 
encodes a peptide of from about 13 to about -25 amino acids 
in length. 

20 

17. A substantially purified nucleic acid segment that 
corresponds to, or is complementary to, at least a 14 
nucleotide long region of the DNA sequence of Figure 2. 

25 . 

18. The nucleic acid segment of claim 17, wherein the 
segment corresponds to, or is complementary to, at least 
an 18 nucleotide long region of the DNA sequence of Figure 

30 2. 



19. The nucleic acid segment of claim 17, wherein the 
segment corresponds to, or is complementary to,* at least 
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an 22 nucleotide long region of the DNA sequence of Figure 
2 . 



5 20. A method for the preparation of a peptide which 
includes at least an epitopic core sequence of 60 kD Ro 
antigen, or a biologically functional equivalent thereof, 
comprising the steps of s 

10 (a) preparing a recombinant vector as defined by any 

one of claims 1 through 16; 



(b) expressing the recombinant vector in ah 
appropriate host to produce the peptide; and 

15 

(c) collecting the peptide. 



21- A method for identifying the presence of a nucleic 
20 acid sequence which encodes at least a portion of 60 JcD Ro 
antigen, or a biologically functional equivalent thereof, 
in a biological sample suspected of containing such a 
sequence, the method comprising the steps of: 

25 (a) incubating nucleic acids from the biological 

sample with a DNA segment as defined by any one 
of claims 9 through 16, or with a nucleic acid 
segment as defined by any one of claims 17 
through 19, under conditions appropriate for the 

30 formation of specific hybrids; and 



(b) 



detecting the formation of specific hybrids 
between the nucleic acids and the segment by 
means of a label, the formation of such hybrids 
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being indicative of the presence of such a 
nucleic acid sequence in the biological sample. 



5 22. The method of claim 21 wherein the biological sample 
comprises a recombinant host cell colony suspected of 
containing a recombinant DNA sequence encoding at least a 
portion of 60 kD Ro antigen. 

10 - ...... 

23. The method of claim 21 wherein the biological sample 
comprises isolated DNA. 



15 24. A method of testing for the presence of anti-Ro 

antibodies in a sample, the method comprising the steps 
of: 

(a) preparing a peptide which includes at least an 
20 epitopic core sequence of 60 kD Ro antigen, or a 

biologically functional equivalent thereof, in 
accordance with claim 20; and 

(b) immunologically testing the sample for anti- 
- 5 bodies which cross react with the peptide. 
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